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Directional tag PCR substractive hybridization was used to construct a rat hypothalamic cDNA library from which cerebellar 
and hippocampal sequences had been depleted. Hypocretin, one of several novel hypothalamic-specific polypeptides identified, isolated 
and sequenced, is localized to regions of the hypothalamus involved in appetite and feeding behavior. Hypocretin polypeptides are 
biologically active, producing electrical changes in neurons, lowering body temperature and reducing food intake. The invention provides 
hypocretin polynucleotides and hypocretin polypeptides as well as antibodies, oligonucleotides, diagnostic kits and methods, and therapeutic 
compositions and methods. 
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HYPOTHALAMUS-SPECIFIC POLYPEPTIDES 

Reference to Related Application 

This application claims the benefit of U.S. Provisional Application 
S.N. 60/023,220, filed August 2, 1996, which is explicitly incorporated by 
reference, as are all references cited herein. 
Governmental Rights 

This invention was made with governmental support from the 
United States Government, National Institutes of Health, Grants GM32355 and 
NS33396; the United States Government has certain rights in the invention. 

Field of the Invention 

This invention relates to the identification, isolation, sequencing, 
use, and expression of hypothalamus-specific proteins and fragments thereof. 
Background of the Invention 

The hypothalamus, a phylogenetically ancient region of the 
mammalian brain, is responsible for the integration of the central nervous system 
and the endocrine system and is particularly related to the physiological response 
to stress. In contrast to laminar cortical structures such as the cerebellum and 
hippocampus whose final functions rely on innervation from the thalamus and 
brain stem, the hypothalamus is organized as a collection of distinct, 
autonomously active nuclei with discrete functions. Ablation and electrical 
stimulation studies and medical malfunctions have implicated several of these 
nuclei as central regulatory centers for major autonomic and endocrine 
homeostatic systems mediating processes such as reproduction, lactation, fluid 
balance, metabolism, and aspects of behaviors, such as circadian rhythmicity, 
basic emotions, feeding and drinking, mating activities, and responses to stress, 
as well as normal development of the immune system (Shepherd, G.M., 
Neurobiology, 3rd ed. Oxford University Press, New York, 1994). Distinct 
hormones and releasing factors have been associated with some of these nuclei 
but, at best, the organizations and molecular operations of these structures are 
only partially understood. 
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A substantial portion of a mammal's genetic endowment is 
dedicated to the function of its central nervous system, as evidenced by the 
substantial number of mRNAs selectively expressed in the brain (Sutcliffe, J.G., 
Ann. Rev. Neurosci. 11:157-198, 1988). Many of these have been observed to be 
5 selectively associated with distinct neural subsets. Existing knowledge of the 
expression of specific hypothalamic hormones and releasing factors suggests that 
ensembles of mRNAs selectively associated with discrete hypothalamic nuclei 
may encode proteins singularly associated with the unique functions of those 
nuclei. 

10 Summary of the Invention 

The present invention provides peptides and polypeptides found in 
the hypothalamus region of the mammalian brain. Preferably, the peptides and 
polypeptides are enriched in the hypothalamus relative to other regions of the 
brain. More preferably the peptides and polypeptides are specific to the 

15 hypothalamus. One embodiment is the rat polypeptide hypocretin also referred to 
as, H35 protein or clone 35 protein (SEQ ID NO:l) and polypeptide analogs 
thereof having at least one conservative amino acid substitution. Another 
embodiment is the mouse hypocretin polypeptide (SEQ ID NO: 2) and 
polypeptide analogs thereof having at least one conservative amino acid 

20 substitution. 

The present invention also provides polynucleotides encoding 
peptides and polypeptides found in the hypothalamus region of the brain. 
Preferably, the polynucleotides encoding peptides and polypeptides are enriched 
in the hypothalamus relative to other regions of the brain. More preferably the 

25 polynucleotides encoding peptides and polypeptides are specific to the 

hypothalamus. One embodiment is a polynucleotide chosen from the group 
consisting of the polynucleotide of SEQ ID NO:3, a polynucleotide having at 
least about 95 % of its nucleotide sequence identical to the polynucleotide of SEQ 
ID NO: 3, and polynucleotides hybridizing to the polynucleotide of SEQ ID NO: 

30 3. Another embodiment is a polynucleotide chosen from the group consisting of 
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the polynucleotide of SEQ ID NO:4, a polynucleotide having at least about 95% 
of its nucleotide sequence identical to the polynucleotide of SEQ ID NO: 3, and 
polynucleotides hybridizing to the polynucleotide of SEQ ID NO: 4. 

Also provided are vectors for the expression of the novel 
5 polynucleotides operably linked to control sequences capable of directing the 
production of the novel polypeptides in suitable host cells. 

In other aspects this invention provides pharmaceutical 
compositions of the polynucleotides, polypeptides and peptides, antibodies to the 
peptides and polypeptides as well as compositions thereof. This invention also 
10 provides assay methods and kits for practicing the methods, and methods for 
using the polynucleotides, peptides and polypeptides for diagnostic and 
therapeutic purposes. 
Brief Description of the Drawings 
In the Drawings, 

15 Fig. 1 shows the results of subtractive screening, enriched for 

sequences selectively expressed in hypothalamus. Replicate dot blots on which 
the indicated masses of plasmid DNA for clones of neuron-specific enolase 
(NSE), cyclophilin, proopiomelanocortin (POMC), vasopressin, the vector 
pT7T3D, protein kinase C5 (PKC5) and growth hormone (GH) were manually 

20 spotted and hybridized with cDNA probes made from cRNA transcribed from the 
target or subtracted libraries, or an equal mixture of the cerebellum and 
hippocampus driver libraries. Comparison of the signal intensities for the 
vasopressin dilution series dots at several levels of autoradiographic exposure 
suggested a 20-to-30 fold increase in the specific activity of vasopressin cDNA. 

25 Fig. 2. shows the results of cDNA library Southern blotting with 

clones representative of the four distribution classes. The electrophoretic lanes 
contain the cerebellum first driver library (Dl), the hippocampus second driver 
library (D2), and the hypothalamus target library (T) cleaved with Haelll and 
hybridized with the inserts from clone 35 (Panel A), clone 10 (Panel B), clone 86 

30 (Panel C) and clone 19 (Panel D). 
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Fig. 3 distribution of hypothalamic mRNAs. Northern blots with 
poly(A) + RNA isolated from extracts of whole brain, olfactory bulb, cerebral 
cortex, hippocampus, hypothalamus, thalamus, cerebellum, pituitary, liver, 
kidney and heart were probed with cDNA inserts from the indicated clones. A 
cyclophilin probe was included in the series as a control for comparable blot 
loading and RNA integrity. The two hypothalamus samples represent inadvertent 
mixtures of approximately equal parts of hypothalamus and striatum. The 
expression patterns are grouped into four classes (A,B,C,D). Only the regions of 
the blots containing the hybridized signal are shown. 

Fig. 4 depicts the expression patterns analyzed by in situ 
hybridization, showing coronal sections of rat brains hybridized with single 
stranded RNA probes corresponding to the inserts of A, clone 35; B, clone 6; C, 
clone 10; D, clone 20; E, clone 29 and F, clone 21. 

Fig. 5 shows a comparison of rat and mouse cDNA and amino acid 
sequences corresponding to clone 35 and the amino acid sequence of the peptide 
hormone secretin. A. The amino acid sequence is listed on the top line, the rat 
nucleotide sequence on the second line and the mouse nucleotide sequence is 
listed on the third line. Differences in nucleotide sequences are indicated by 
asterisks below each different base, amino acid differences are indicated by 
alternatives (rat/mouse) listed above the encoding triplets. Tandem basic amino 
acids (putative sites for proteolytic maturation) are indicated in bold italics, as is 
the serine residue most likely to represent the end of the secretion signal. 
B. Alignment of hcrtl and hcrt2 amino acid sequences with the amino acid 
sequence of secretin. The first 9 amino acid residues of secretin have been 
repeated to indicate apparent circular permutation. The identities between the 
hypocretins and members of the glucagon/vasoactive intestinal 
polypeptide/secretin family (H.-C.Fehmann, R. Goke, B. Goke, Endocrine 
Reviews, 16, 390 (1995)) are indicated by asterisks; the hcrtl and hcrt2 
consensus residues appear above the alignment. 

Fig. 6 shows the cDNA and amino acid sequence of clone 29. 
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Fig. 7 is a graphical representation of the results of voltage clamp 
experiments on isolated in vitro rat hypothalamic cells, in which application of 
1 fig hcrt2 produced electrical responses in adult but not immature neurons. 
Detailed Description of Preferred Embodiments 
5 The following definitions are set forth to illustrate and define the 

meaning and scope of the various terms used to describe the invention herein. 
All patents and other publications mentioned in this specification are expressly 
incorporated by reference herein. 

A. Definitions 

10 Amino Acid Residue : An amino acid formed upon 

chemical digestion (hydrolysis) of a polypeptide at its peptide linkages. The 
amino acid residues described herein are preferably in the "L M isomeric form. 
However, residues in the ,r D" isomeric form can be substituted for any L-amino 
acid residue, as long as the desired functional property is retained by the 

15 polypeptide. NH 2 refers to the free amino group present at the amino terminus of 
a polypeptide. COOH refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. The standard polypeptide nomenclature 
(described in J. Biol. Chem. . 243:3552-59 (1969) and adopted at 37 CFR 
§ 1.822(b)(2)) that provides one letter and three letter codes for amino acid 

20 residues is used. 

It should be noted that all amino acid residue sequences 
represented herein by formulae have a left- to-right orientation in the conventional 
direction of amino terminus to carboxy terminus. In addition, the phrase "amino 
acid residue" is broadly defined to include modified and unusual amino acids, 

25 such as those listed in 37 CFR 1.822(b)(4), and incorporated herein by reference. 
Furthermore, it should be noted that a dash at the beginning or end of an amino 
acid residue sequence indicates a peptide bond to a further sequence of one or 
more amino acid residues or a covalent bond to an amino-terminal group such as 
NH 2 or acetyl or to a carboxy-terminal group such as COOH. 

30 Recombinant DNA molecule : a DNA molecule produced by 
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operatively linking two DNA segments. Thus, a recombinant DNA molecule is a 
hybrid DNA molecule comprising at least two nucleotide sequences not normally 
found together in nature. 

Receptor : A receptor is a molecule, such as a protein, 
glycoprotein and the like, that can specifically (non-randomly) bind to another 
molecule. 

Antibody : The term antibody in its various grammatical forms is 
used herein to refer to immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules that contain an antibody 
combining site or paratope. Exemplary antibody molecules are intact 
immunoglobulin molecules, substantially intact immunoglobulin molecules and 
portions of an immunoglobulin molecule, including those portions known in the 
art as Fab, Fab', and F(ab') 2 . 

Antibody Combining Site : An antibody combining site is that 
structural portion of an antibody molecule comprised of a heavy and light chain 
variable and hypervariable regions that specifically binds (immunoreacts with) an 
antigen. The term immunoreact in its various forms means specific binding 
between an antigenic determinant-containing molecule and a molecule containing 
an antibody combining site such as a whole antibody molecule or a portion 
thereof. 

Monoclonal Antibody : A monoclonal antibody in its various 
grammatical forms refers to a population of antibody molecules that contain only 
one species of antibody combining site capable of immunoreacting with a 
particular epitope. A monoclonal antibody thus typically displays a single 
binding affinity for any epitope with which it immunoreacts. A monoclonal 
antibody may therefore contain an antibody molecule having a plurality of 
antibody combining sites, each immunospecific for a different epitope, e.g., a 
bispecific monoclonal antibody. 

Upstream : In the direction opposite to the direction of DNA 
transcription, and therefore going from 5' to 3' on the non-coding strand, or 3' to 
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5' on the mRNA. 

Downstream : Further along a DNA sequence in the direction of 
sequence transcription or read out, that is traveling in a 3'- to 5 '-direction along 
the non-coding strand of the DNA or 5 9 - to 3'-direction along the RNA 
5 transcript. 

Polypeptide : A linear series of amino acid residues connected to 
one another by peptide bonds between the alpha-amino group and carboxy group 
of contiguous amino acid residues. 

Protein : A linear series of more than 50 amino acid residues 

10 connected one to the other as in a polypeptide. 

Substantially Purified or Isolated : When used in the context of 
polypeptides or proteins, the terms describe those molecules that have been 
separated from components that naturally accompany them. Typically, a 
monomeric protein is substantially pure when at least about 60% to 75% of a 

15 sample exhibits a single polypeptide backbone. Minor variants or chemical 
modifications typically share the same polypeptide sequence. A substantially 
purified protein will typically comprise over about 85% to 90% of a protein 
sample, more usually about 95%, and preferably will be over about 99% pure. 
Protein or polypeptide purity or homogeneity may be indicated by a number of 

20 means well known in the art, such as polyacrylamide gel electrophoresis of a 

sample, followed by visualization thereof by staining. For certain purposes, high 
resolution is needed and high performance liquid chromatography (HPLC) or a 
similar means for purification utilized. 

Synthetic Peptide : A chemically produced chain of amino acid 

25 residues linked together by peptide bonds that is free of naturally occurring 
proteins and fragments thereof. 

Nucleic acid or polynucleotide sequence : includes, but is not 
limited to, eucaryotic mRNA, cDNA, genomic DNA, and synthetic DNA and 
RNA sequences, comprising the natural nucleoside bases adenine, guanine, 

30 cytosine, thymidine, and uracil. The term also encompasses sequences having 
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one or more other bases including, but not limited to 4-acetylcytosine, 8-hydroxy- 
N6-methy ladenine, aziridinylcytosine, pseudoisocytosine, 5- 
(carboxyhydroxylmethyl)uracil, 5-fluorouracil, 5-bromouracil, 5- 
carboxymethy laminomethy 1-2-thiouracil , 5 -carboxy methy laminomethy luracil , 
dihydrouracil, inosine, N6-isopentenyl-adenine, 1 -methy ladenine, 1- 
methylpseudouracil, 1 -methy Iguanine, 1 -methy 1-inosine, 2,2-dimethylguanine, 2- 
methy ladenine, 2-methylguanine, 3-methyl-cytosine, 5-methylcytosine, N6- 
methy ladenine, 7-methylguanine, 5-methyl-aminomethy luracil, 5- 
methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'- 
methoxycarbonylmethy luracil, 5-methoxyuracil, 2-methylthio-N6- 
isopenteny ladenine, uracil-5-oxyacetic acid methy lester, uracil-5-oxyacetic acid, 
oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2- 
thiouracil, 4-thiouracil, 5-methy luracil, and 2,6-diaminopurine. 

Coding sequence or open reading frame: a polynucleotide or 
nucleic acid sequence which is transcribed (in the case of DNA) or translated (in 
the case of mRNA) into a polypeptide in vitro or in vivo when placed under the 
control of appropriate regulatory sequences. The boundaries of the coding 
sequence are determined by a translation start codon at the 5' (amino) terminus 
and a translation stop codon at the 3' (carboxy) terminus. A transcription 
termination sequence will usually be located 3* to the coding sequence. 

Nucleic acid control sequences : translational start and stop codons, 
promoter sequences, ribosome binding sites, polyadenylation signals, transcription 
termination sequences, upstream regulatory domains, enhancers, and the like, as 
are necessary and sufficient for the transcription and translation of a given coding 
sequence in a defined host cell. Examples of control sequences suitable for 
eucaryotic cells are promoters, polyadenylation signals, and enhancers. All of 
these control sequences need not be present in a recombinant vector so long as 
those necessary and sufficient for the transcription and translation of the desired 
gene are present. 

Operablv or operativelv linked : the configuration of the coding and 
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control sequences so as to perform the desired function. Thus, control sequences 
operably linked to a coding sequence are capable of effecting the expression of 
the coding sequence. A coding sequence is operably linked to or under the 
control of transcriptional regulatory regions in a cell when DNA polymerase will 
5 bind the promoter sequence and transcribe the coding sequence into mRNA that 
can be translated into the encoded protein. The control sequences need not be 
contiguous with the coding sequence, so long as they function to direct the 
expression thereof. Thus, for example, intervening untranslated yet transcribed 
sequences can be present between a promoter sequence and the coding sequence 
10 and the promoter sequence can still be considered "operably linked" to the coding 
sequence. 

Heterologous and exogenous : as they relate to nucleic acid 
sequences such as coding sequences and control sequences, denote sequences that 
are not normally associated with a region of a recombinant construct, and are not 

15 normally associated with a particular cell. Thus, a "heterologous" region of a 

nucleic acid construct is an identifiable segment of nucleic acid within or attached 
to another nucleic acid molecule that is not found in association with the other 
molecule in nature. For example, a heterologous region of a construct could 
include a coding sequence flanked by sequences not found in association with the 

20 coding sequence in nature. Another example of a heterologous coding sequence 
is a construct where the coding sequence itself is not found in nature (e.g., 
synthetic sequences having codons different from the native gene). Similarly, a 
host cell transformed with a construct which is not normally present in the host 
cell would be considered heterologous for purposes of this invention. 

25 Expression system : polynucleotide sequences containing a desired 

coding sequence and control sequences in operable linkage, so that cells 
transformed with these sequences are capable of producing the encoded product. 
In order to effect transformation, the expression system may be included on a 
discrete vector; however, the relevant polynucleotide may also be integrated into 

30 the host chromosome. 
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Vector : a recombinant polynucleotide comprised of single strand, 
double strand, circular, or supercoiled DNA or RNA. A typical vector may be 
comprised of the following elements operatively linked at appropriate distances 
for allowing functional gene expression: replication origin, promoter, enhancer, 
5' mRNA leader sequence, ribosomal binding site, nucleic acid cassette, 
termination and polyadenylation sites, and selectable marker sequences. One or 
more of these elements may be omitted in specific applications. The nucleic acid 
cassette can include a restriction site for insertion of the nucleic acid sequence to 
be expressed. In a functional vector the nucleic acid cassette contains the nucleic 
acid sequence to be expressed including translation initiation and termination 
sites. An intron optionally may be included in the construct, preferably > 100 
bp and placed 5' to the coding sequence. 

A vector is constructed so that the particular coding sequence is 
located in the vector with the appropriate regulatory sequences, the positioning 
and orientation of the coding sequence with respect to the control sequences being 
such that the coding sequence is transcribed under the "control' 1 of the control 
sequences. Modification of the sequences encoding the particular protein of 
interest may be desirable to achieve this end. For example, in some cases it 
may be necessary to modify the sequence so that it may be attached to the control 
sequences with the appropriate orientation; or to maintain the reading frame. The 
control sequences and other regulatory sequences may be ligated to the coding 
sequence prior to insertion into a vector. Alternatively, the coding sequence can 
be cloned directly into an expression vector which already contains the control 
sequences and an appropriate restriction site which is in reading frame with and 
under regulatory control of the control sequences. 

Suitable marker sequences for identification and isolation of 
correctly transfected cells include the thymidine kinase (tk), dihydrofolate 
reductase (DHFR), and aminoglycoside phosphotransferase (APH) genes. The 
latter imparts resistance to the aminoglycoside antibiotics, such as kanamycin, 
neomycin, and geneticin. These, and other marker genes such as those encoding 
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chloramphenicol acetyltransferase (CAT) and 0-galactosidase (/3-gal), may be 
incorporated into the primary nucleic acid cassette along with the gene expressing 
the desired therapeutic protein, or the selection markers may be contained on 
separate vectors and cotransfected. 
5 The term "biochemically equivalent variations" means protein or 

nucleic acid sequences which differ in some respect from the specific sequences 
disclosed herein, but nonetheless exhibit the same or substantially the same 
functionality. In the case of cDNA, for example, this means that modified 
sequences which contain other nucleic acids than those specifically disclosed are 

10 encompassed, provided that the alternate cDNA encodes mRNA which in turn 
encodes a protein of this invention. Such modifications may involve the 
substitution of only a few nucleic acids, or many. The modifications may involve 
substitution of degenerate coding sequences or replacement of one coding 
sequence with another; introduction of non-natural nucleic acids is included. 

15 Preferably, the modified nucleic acid sequence hybridizes to and is at least 95% 
complementary to the sequence of interest. 

Similarly, in the case of the proteins and polypeptides of this 
invention, alterations in the amino acid sequence which do not affect functionality 
may be made. Such "biochemically equivalent muteins" may involve replacement 

20 of one amino acid with another, use of side chain modified or non-natural amino 
acids, and truncation. The skilled artisan will recognize which sites are most 
amenable to alteration without affecting the basic function. 

The expression products described herein are proteins and 
polypeptides having a defined chemical sequence. However, the precise structure 

25 depends on a number of factors, particularly chemical modifications common to 
proteins. For example, since all proteins contain ionizable amino and carboxyl 
groups, the protein may be obtained in acidic or basic salt form, or in neutral 
form. The primary amino acid sequence may be derivatized using sugar 
molecules (glycosylation) or by other chemical derivatizations involving covalent 

30 or ionic attachment with, for example, lipids, phosphate, acetyl groups and the 
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like, often occurring through association with saccharides. These modifications 
may occur in vitro, or in vivo, the latter being performed by a host cell through 
post-translational processing systems. Such modifications may increase or 
decrease the biological activity of the molecule, and such chemically modified 
molecules are also intended to come within the scope of the invention. 
B. Hvpocretin Proteins and Polypeptides 
Hypocretin or clone H35, has been cloned in both rat and mouse. 
The amino acid residue sequence in these two mammalian species is not identical 
but is sufficiently similar to permit generalization regarding function, and so that 
one can identify and isolate the hypocretin gene in any mammalian species. 

Variations at both the amino acid and nucleotide sequence level are 
described in isolates of hypocretin, and such variations are not to be construed as 
limiting. For example, allelic variation within a mammalian species can tolerate 
a several percent difference between isolates of a type of hypocretin, which 
differences comprise non-deleterious variant amino acid residues. Thus a protein 
of about 95% homology, and preferably at least 98% homology, to a disclosed 
hypocretin is considered to be an allelic variant of the disclosed hypocretin, and 
therefore is considered to be a hypocretin of this invention. 

As disclosed herein, hypocretin is produced first in vivo in 
precursor form, and is then processed into smaller polypeptides having biological 
activity as described herein. Insofar as these different polypeptide forms are 
useful, the term hypocretin protein or polypeptide connotes all species of 
polypeptide having an amino acid residue sequence derived from the hypocretin 
gene. 

The complete coding nucleotide sequence, clone 35, of rat H35 
cDNA is 569 nucleotides in length, and is listed in SEQ ID NO 3. The complete 
preprohypocretin cDNA clone presents a 390 nucleotide open reading frame 
(ORF) plus triplet termination codons (Fig. 5). There is a N-terminal signal 
peptide with a cleavage site between amino acid positions 27 and 28, 
corresponding to a cleavage site after nucleotide position 172 of SEQ ID NO:3. 
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Translation of this rat cDNA sequence produces a novel protein of 
130 amino acid residues, referred to as rat preprohypocretin. The amino acid 
sequence of rat preprohypocretin is listed in SEQ ID NO: 1. The amino acid 
sequence of mouse preprohypocretin is listed in SEQ ID NO: 2. 
5 A hypocretin protein of this invention can be in a variety of forms, 

depending upon the use therefor, as described herein. For example, a hypocretin 
can be isolated from a natural tissue. 

Alternatively, a hypocretin protein of this invention can be a 
recombinant protein, that is, produced by recombinant DNA methods as 

10 described herein. A recombinant hypocretin protein need not necessarily be 

substantially pure, or even isolated, to be useful in certain embodiments, although 
recombinant production methods are a preferred means to produce a source for 
further purification to yield an isolated or substantially pure receptor composition. 
A recombinant hypocretin protein can be present in or on a mammalian cell line 

15 or in crude extracts of a mammalian cell line. 

In one embodiment, a hypocretin protein is substantially free of 
other neuropeptides, so that the purity of a hypocretin reagent, and thus freedom 
from pharmacologically distinct proteins, facilitates use in the screening methods. 
The recombinant production methods are ideally suited to produce significantly 

20 improved purity in this regard, although biochemical purification methods from 
natural sources are also included. In this regard, a hypocretin protein is 
substantially free from other neuropeptides if there are insufficient other 
neuropeptides such that pharmacological cross-reactivity is not detected in 
conventional screening assays for ligand binding or biological activity. 

25 Alternatively, recombinant hypocretin fusion proteins can be produced by joining 
nucleotides encoding additional amino acid residue sequence in proper reading 
frame at the 3' end of the hypocretin sequence. The fusion protein thus produced 
exhibits properties of the added amino acid sequence in addition to the properties 
of hypocretin. For example, the additional amino acid sequence may serve to 

30 help identify and purity the recombinantly produced hypocretin fusion protein. 
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One preferred hypocretin fusion protein is hypocretin-poly(His). 

Preferably, a hypocretin protein of this invention is present in a 
composition in an isolated form, i.e., comprising at least about 0.1 percent by 
weight of the total composition, preferably at least 1%, and more preferably at 
least about 90%. Particularly preferred is a substantially pure preparation of 
hypocretin, that is at least 90% by weight, and more preferably at least 99% by 
weight. Biochemical methods useful for the enrichment and preparation of an 
isolated hypocretin based on the chemical properties of a polypeptide are well 
known, and can be routinely used for the production of proteins which are 
enriched by greater than 99% by weight. 

An isolated or recombinant hypocretin protein of this invention can 
be used for a variety of purposes, as described further herein. A hypocretin 
protein can be used as an immunogen to produce antibodies immunoreactive with 
hypocretin. Hypocretin proteins can be used in in vitro ligand binding assays for 
identifying ligand binding specificities, and agonists or antagonists thereto, to 
characterize candidate pharmaceutical compounds useful for modulating 
hypocretin function, and as therapeutic agents for effecting hypocretin functions. 
Other uses will be readily apparent to one skilled in the art. 

Furthermore, the invention includes analogs of a hypocretin protein 
of this invention. An analog is a man-made variant which exhibits the qualities 
of a hypocretin of this invention in terms of immunological reactivity, ligand 
binding capacity or the like functional properties of a hypocretin protein of this 
invention. An analog can therefore be a cleavage product of hypocretin, can be a 
polypeptide corresponding to a portion of hypocretin, can be hypocretin 
polypeptide in which a membrane anchor has been removed, and can be a variant 
hypocretin sequence in which some amino acid residues have been altered, to 
name a few alternatives. 

Insofar as the present disclosure identifies hypocretin from 
different mammalian species, the present invention is not to be limited to a 
hypocretin protein derived from one or a few mammalian species. Thus, the 
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invention includes a mammalian hypocretin protein, which can be derived, by 
recombinant DNA or biochemical purification from natural sources, from any of 
a variety of species including man, mouse, rabbit, rat, dog, cat, sheep, cow, and 
the like mammalian species, without limitation. Human and agriculturally 
5 relevant animal species are particularly preferred. 

Exemplary hypocretin species identified herein are rat and mouse 

hypocretin. 

The amino acid reside sequence of rat preprohypocretin is shown 
in SEQ ID NO 1 , and corresponding nucleotide (cDNA) of rat preprohypocretin 
10 is shown in SEQ ID NO 3. 

The amino acid residue sequence of mouse preprohypocretin is 
shown in SEQ ID NO 2, and corresponding nucleotide (cDNA) of mouse 
preprohypocretin is shown in SEQ ID NO 4. 

A hypocretin protein of this invention can be prepared by a variety 
15 of means, although expression in a mammalian cell using a recombinant DNA 

expression vector is preferred. Exemplary production methods for a recombinant 
hypocretin are described in the Examples. 

The invention also provides a method for the production of isolated 
hypocretin proteins, either as intact hypocretin protein, as fusion proteins or as 
20 smaller polypeptide fragments of hypocretin. The production method generally 
involves inducing cells to express a hypocretin protein of this invention, 
recovering the hypocretin from the resulting cells, and purifying the hypocretin so 
recovered by biochemical fractionation methods, using a specific antibody of this 
invention, or other chemical procedures. 
25 The inducing step can comprise inserting a recombinant DNA 

vector encoding a hypocretin protein, or fragment thereof, of this invention, 
which recombinant DNA is capable of expressing a hypocretin, into a suitable 
host cell, and expressing the vector's hypocretin gene. 

As used herein, the phrase "hypocretin polypeptide" refers to a 
30 polypeptide having an amino acid residue sequence that comprises an amino acid 
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residue sequence that corresponds, and preferably is identical, to a portion of a 
hypocretin of this invention. 

A hypocretin polypeptide of this invention is characterized by its 
ability to immunologically mimic an epitope (antigenic determinant) expressed by 
a hypocretin of this invention. Such a polypeptide is useful herein as a 
component in an inoculum for producing antibodies that immunoreact with native 
hypocretin and as an antigen in immunologic methods. Representative and 
preferred hypocretin polypeptides for use as an immunogen in an inoculum are 
shown herein. 

As used herein, the phrase "immunologically mimic" in its various 
grammatical forms refers to the ability of a hypocretin polypeptide of this 
invention to immunoreact with an antibody of the present invention that 
recognizes a conserved native epitope of a hypocretin as defined herein. 

It should be understood that a subject polypeptide need not be 
identical to the amino acid residue sequence of a hypocretin receptor, so long as 
it includes the required sequence. 

In addition, certain hypocretin polypeptides derived from receptor 
binding portions of hypocretin have the capacity to inhibit the binding of the 
hypocretin that would normally bind a hypocretin receptor. Thus, the invention 
also includes hypocretin polypeptides which are specifically designed for their 
capacity to mimic exposed regions of hypocretin involved in hypocretin receptor 
binding interactions and thereby receptor function. Therefore, these polypeptides 
have the capacity to function as analogs to hypocretin, and thereby block 
function. 

In addition, polypeptides corresponding to exposed domains have 
the ability to induce antibody molecules that immunoreact with a hypocretin of 
this invention at portions of hypocretin involved in receptor protein function, and 
therefor the antibodies are also useful at modulating normal hypocretin function. 

A hypocretin polypeptide is preferably no more than about 120 
amino acid residues in length for reasons of ease of synthesis. Thus, it more 
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preferred that a hypocretin polypeptide be no more that about 100 amino acid 
residues, still more preferably no more than about 50 residues, and optimally less 
than 40 amino acid residues in length when synthetic methods of production are 
used. Exemplary polypeptides are hcrtl and hcrt2. 
5 The present invention also includes a hypocretin polypeptide that 

has an amino acid residue sequence that corresponds to the sequence of the 
hypocretin protein shown in the sequence listings, and includes an amino acid 
residue sequence represented by a formula selected from the group consisting of 
the polypeptides shown in the sequence listings. In this embodiment, the 

10 polypeptide is further characterized as having the ability to mimic a hypocretin 
epitope and thereby inhibits hypocretin function in a classic hypocretin receptor 
activation assay, as described herein. 

Due to the three dimensional structure of a native folded 
hypocretin molecule, the present invention includes that multiple regions of 

15 hypocretin are involved in hypocretin receptor function, which multiple and 
various regions are defined by the various hypocretin polypeptides described 
above. A preferred hypocretin receptor ligand is hcrt. The ability of the above- 
described polypeptides to inhibit receptor-ligand binding can readily be measured 
in a ligand binding assay as is shown in the Examples herein. Similarly, the 

20 ability of the above-described polypeptides to inhibit hypocretin receptor function 
can readily be measured in a receptor assay as is described herein. 

In another embodiment, the invention includes hypocretin 
polypeptide compositions that comprise one or more of the different hypocretin 
polypeptides described above which inhibit hypocretin receptor function, admixed 

25 in combinations to provide simultaneous inhibition of multiple contact sites on the 
hypocretin receptor. 

A subject polypeptide includes any analog, fragment or chemical 
derivative of a polypeptide whose amino acid residue sequence is shown herein so 
long as the polypeptide is capable of mimicking an epitope of hypocretin. 

30 Therefore, a present polypeptide can be subject to various changes, substitutions, 
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insertions, and deletions where such changes provide for certain advantages in its 
use. In this regard, a hypocretin polypeptide of this invention corresponds to, 
rather than is identical to, the sequence of a hypocretin protein where one or 
more changes are made and it retains the ability to induce antibodies that 
immunoreact with a hypocretin of this invention. 

The term "analog" includes any polypeptide having an amino acid 
residue sequence substantially identical to a sequence specifically shown herein in 
which one or more residues have been conservatively substituted with a 
functionally similar residue and which displays the ability to induce antibody 
production as described herein. Examples of conservative substitutions include 
the substitution of one non-polar (hydrophobic) residue such as isoleucine, valine, 
leucine or methionine for another, the substitution of one polar (hydrophilic) 
residue for another such as between arginine and lysine, between glutamine and 
asparagine, between glycine and serine, the substitution of one basic residue such 
as lysine, arginine or histidine for another, or the substitution of one acidic 
residue, such as aspartic acid or glutamic acid for another. 

The phrase "conservative substitution" also includes the use of a 
chemically derivatized residue in place of a non-derivatized residue provided that 
such polypeptide displays the requisite binding activity. 

"Chemical derivative" refers to a subject polypeptide having one or 
more residues chemically derivatized by reaction of a functional side group. 
Such derivatized molecules include for example, those molecules in which free 
amino groups have been derivatized to form amine hydrochlorides, p-toluene 
sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl groups, chloroacetyl 
groups or formyl groups. Free carboxyl groups may be derivatized to form salts, 
methyl and ethyl esters or other types of esters or hydrazides. Free hydroxy 1 
groups may be derivatized to form O-acyl or O-alkyl derivatives. The imidazole 
nitrogen of histidine may be derivatized to form N-im-benzylhistidine. Also 
included as chemical derivatives are those peptides which contain one or more 
naturally occurring amino acid derivatives of the twenty standard amino acids. 
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For examples: 4-hydroxyproline may be substituted for proline; 5-hydroxylysine 
may be substituted for lysine; 3-methylhistidine may be substituted for histidine; 
homoserine may be substituted for serine; and ornithine may be substituted for 
lysine. D-amino acids may also be included in place of one or more L-amino 
5 acids. Polypeptides of the present invention also include any polypeptide having 
one or more additions and deletions or residues relative to the sequence of a 
polypeptide whose sequence is shown herein, so long as the requisite activity is 
maintained. 

The term "fragment" refers to any subject polypeptide having an 

10 amino acid residue sequence shorter than that of a polypeptide whose amino acid 
residue sequence is shown herein. 

When a polypeptide of the present invention has a sequence that is 
not identical to the sequence of a hypocretin polypeptide, it is typically because 
one or more conservative or non-conservative substitutions have been made, 

15 usually no more than about 30 number percent, more usually no more than 20 
number percent, and preferably no more than 10 number percent of the amino 
acid residues are substituted. Additional residues may also be added at either 
terminus for the purpose of providing a "linker" by which the polypeptides of this 
invention can be conveniently affixed to a label or solid matrix, or carrier. 

20 Preferably the linker residues do not form a hypocretin epitope, i.e., are not 
similar is structure to a hypocretin protein. 

Labels, solid matrices and carriers that can be used with the 
polypeptides of this invention are described hereinbelow. 

Amino acid residue linkers are usually at least one residue and can 

25 be 40 or more residues, more often 1 to 10 residues, but do not form a 

hypocretin epitope. Typical amino acid residues used for linking are tyrosine, 
cysteine, lysine, glutamic and aspartic acid, or the like. In addition, a subject 
polypeptide can differ, unless otherwise specified, from the natural sequence of a 
hypocretin protein by the sequence being modified by terminal-NH 2 acylation, 

30 e.g., acetylation, or thioglycolic acid amidation, by terminal-carboxlyamidation, 
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e.g., with ammonia, methylamine, and the like. 

When coupled to a carrier to form what is known in the art as a 
carrier-hapten conjugate, a hypocretin polypeptide of the present invention is 
capable of inducing antibodies that immunoreact with hypocretin. In view of the 
well established principle of immunologic cross-reactivity, the present invention 
therefore includes antigenically related variants of the polypeptides shown herein. 
An "antigenically related variant" is a subject polypeptide that is capable of 
inducing antibody molecules that immunoreact with a polypeptide described 
herein and with a hypocretin protein of this invention. 

Any peptide of the present invention may be used in the form of a 
pharmaceutical^ acceptable salt. Suitable acids which are capable of forming 
salts with the peptides of the present invention include inorganic acids such as 
hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, 
sulfuric acid, phosphoric acetic acid, propionic acid, glycolic acid, lactic acid, 
pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, fumaric acid, 
anthranilic acid, cinnamic acid, naphthalene sulfonic acid, sulfanilic acid or the 
like. 

Suitable bases capable of forming salts with the peptides of the 
present invention include inorganic bases such as sodium hydroxide, ammonium 
hydroxide, potassium hydroxide and the like; and organic bases such as mono-, 
di- and tri-alkyl and aryl amines (e.g. triethylamine, diisopropyl amine, methyl 
amine, dimethyl amine and the like) and optionally substituted ethanolamines 
(e.g. ethanolamine, diethanolamine and the like). 

A hypocretin polypeptide of the present invention, also referred to 
herein as a subject polypeptide, can be synthesized by any of the techniques that 
are known to those skilled in the polypeptide art, including recombinant DNA 
techniques. Synthetic chemistry techniques, such as a solid-phase Merrifield-type 
synthesis, are preferred for reasons of purity, antigenic specificity, freedom from 
undesired side products, ease of production and the like. An excellent summary 
of the many techniques available can be found in J.M. Steward and J.D. Young, 
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"Solid Phase Peptide Synthesis", W.H. Freeman Co., San Francisco, 1969; ML 
Bodansky, et al., "Peptide Synthesis", John Wiley & Sons, Second Edition, 1976 
and J. Meienhofer, "Hormonal Proteins and Peptides", Vol. 2, p. 46, Academic 
Press (New York), 1983 for solid phase peptide synthesis, and E. Schroder and 
K. Kubke, "The Peptides", Vol. 1, Academic Press (New York), 1965 for 
classical solution synthesis, each of which is incorporated herein by reference. 
Additional peptide synthesis methods are described by Sutcliffe in U.S. Patent 
No. 4,900,811 and 5,242,798, which are hereby incorporated by reference. 
Appropriate protective groups usable in such synthesis are described in the above 
texts and in J.F.W. McOmie, "Protective Groups in Organic Chemistry", Plenum 
Press, New York, 1973, which is incorporated herein by reference. 

In general, the solid-phase synthesis methods comprise the 
sequential addition of one or more amino acid residues or suitably protected 
amino acid residues to a growing peptide chain. Normally, either the amino or 
carboxyl group of the first amino acid residue is protected by a suitable, 
selectively removable protecting group. A different, selectively removable 
protecting group is utilized for amino acids containing a reactive side group such 
as lysine. 

Using a solid phase synthesis as exemplary, the protected or 
derivatized amino acid is attached to an inert solid support through its unprotected 
carboxyl or amino group. The protecting group of the amino or carboxyl group 
is then selectively removed and the next amino acid in the sequence having the 
complimentary (amino or carboxyl) group suitably protected is admixed and 
reacted under conditions suitable for forming the amide linkage with the residue 
already attached to the solid support. The protecting group of the amino or 
carboxyl group is then removed from this newly added amino acid residue, and 
the next amino acid (suitably protected) is then added, and so forth. After all the 
desired amino acids have been linked in the proper sequence, any remaining 
terminal and side group protecting groups (and solid support) are removed 
sequentially or concurrently, to afford the final polypeptide. 
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A hypocretin polypeptide can be used, inter alia , in the diagnostic 
methods and systems of the present invention to detect a hypocretin receptor or 
hypocretin itself present in a body sample, or can be used to prepare an inoculum 
as described herein for the preparation of antibodies that immunoreact with 
conserved epitopes on hypocretin. 

In addition, certain of the hypocretin polypeptides of this invention 
can be used in the therapeutic methods of the present invention to inhibit 
hypocretin function as described further herein. 

C. Nucleic Acids and Polynucleotides 

The DNA segments of the present invention are characterized as 
including a DNA sequence that encodes a hypocretin protein of this invention. 
That is, the DNA segments of the present invention are characterized by the 
presence of some or all of a hypocretin structural gene. Preferably the gene is 
present as an uninterrupted linear series of codons where each codon codes for an 
amino acid residue found in the hypocretin protein, i.e., a gene free of introns. 

One preferred embodiment is a DNA segment that codes an amino 
acid residue sequence that defines a hypocretin protein as defined herein, and the 
DNA segment is capable of expressing a hypocretin protein of this invention. A 
preferred DNA segment codes for an amino acid residue sequence substantially 
the same as, and preferably consisting essentially of, an amino acid residue 
sequence shown in the sequence listing for a hypocretin protein, such as in SEQ 
ID NOs 1 and 2. 

The amino acid residue sequence of a protein or polypeptide is 
directly related via the genetic code to the deoxyribonucleic acid (DNA) sequence 
of the structural gene that codes for the protein. Thus, a structural gene or DNA 
segment can be defined in terms of the amino acid residue sequence, i.e., protein 
or polypeptide, for which it codes. 

An important and well known feature of the genetic code is its 
degeneracy. That is, for most of the amino acids used to make proteins, more 
than one coding nucleotide triplet (codon) can code for or designate a particular 
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amino acid residue. Therefore, a number of different nucleotide sequences may 
code for a particular amino acid residue sequence. Such nucleotide sequences are 
considered functionally equivalent since they can result in the production of the 
same amino acid residue sequence in all organisms. Occasionally, a methylated 
5 variant of a purine or pyrimidine may be incorporated into a given nucleotide 
sequence. However, such methylations do not affect the coding relationship in 
any way. 

A nucleic acid is any polynucleotide or nucleic acid fragment, 
whether it be a polyribonucleotide of polydeoxyribonucleotide, i.e., RNA or 
10 DNA, or analogs thereof. In preferred embodiments, a nucleic acid molecule is 
in the form of a segment of duplex DNA, i.e, a DNA segment, although for 
certain molecular biological methodologies, single-stranded DNA or RNA is 
preferred. 

DNA segments (i.e., synthetic oligonucleotides) that encode 
15 portions of hypocretin proteins can easily be synthesized by chemical techniques, 
for example, the phosphotriester method of Matteucci, et al., (J. Am. Chem. 
Soc , 103:3185-3191, 1981) or using automated synthesis methods. In addition, 
larger DNA segments can readily be prepared by well known methods, such as 
synthesis of a group of oligonucleotides that define the DNA segment, followed 
20 by hybridization and ligation of oligonucleotides to build the complete segment. 

Of course, by chemically synthesizing the coding sequence, any 
desired modifications can be made simply by substituting the appropriate bases 
for those encoding the native amino acid residue sequence. 

Furthermore, DNA segments consisting essentially of structural 
25 genes encoding a hypocretin protein can be obtained from recombinant DNA 
molecules containing a gene that defines a hypocretin protein of this invention, 
and can be subsequently modified, as by site directed mutagenesis, to introduce 
any desired substitutions. 

1. Cloning Hypocretin Genes 
30 Hypocretin genes of this invention can be cloned by a variety of 
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cloning methods and from any mammalian species. The cloning is based on the 
observation that there is a significant degree of homology between mammalian 
species for any given hypocretin of this invention, and therefor can be conducted 
according to the general methods described in the Examples, using nucleic acid 
homology strategies. 

A typical degree of homology required to successfully clone a 
hypocretin is at least about 80% homologous at the DNA level, and at least about 
90% homologous at the protein level. Preferred cloning strategies for isolating a 
nucleic acid molecule that encodes a hypocretin molecule of this invention are 
described in the Examples, and includes the recitation of polynucleotide probes 
useful for the screening of libraries of nucleic acid molecules believed to contain 
a target hypocretin gene. 

Sources of libraries for cloning a hypocretin gene of this invention 
can include genomic DNA or messenger RNA (mRNA) in the form of a cDNA 
library from a tissue believed to express a hypocretin of this invention. Preferred 
tissues are brain tissues, particularly hypothalamic tissue. The similarities 
between rat and mouse hypocretin are further extended to the identification of a 
sequence of iteration of trinucleotide CTG repeats. For both mammals, a 
sequence of four iterations of the trinucleotide CTG repeats followed by two pairs 
of CTG are present encoding leucine residues. Thus, the presence of the 
iterations is typically located within the coding region for the signal peptide. 

Such a triplet expansion in other genes has been implicated as 
causal in neurological diseases, e.g., myotonic dystrophy as described by Brook 
et al., CeU, 68:799-808 (1992) and fragile-X syndrome as described by Fu et al., 
CeU, 67:1047-1058 (1991). In myotonic dystrophy patients who are mildly 
affected, at least 50 CTG repeats are present. In severely affected individuals, 
the expansion can exist up to several kilobase pairs. In contrast, in the normal 
population, the repeat sequence is highly variable ranging from 5 to 27 copies. 
Individuals with varying severities of fragile-X have been similarly characterized. 
Screening for the presence of a region of DNA in which the 
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repeats are present in either normal, underexpansion or overexpansion form can 
provide a genetic basis for diagnosis for some diseases. The same may be true 
for hypocretin in that expansion of the region may contribute to the basis for a 
neuronal disorder or disease of the brain or other tissue. 
5 2. Oligonucleotides 

The invention also includes oligonucleotides useful for methods to 
detect the presence of a hypocretin gene or gene transcript (mRNA) in a tissue by 
diagnostic detection methods based on the specificity of nucleic acid hybridization 
or primer extension reactions. One embodiment includes any polynucleotide 

10 probe having a sequence of a portion of a hypocretin gene of this invention, or a 
related and specific sequence. Hybridization probes can be of a variety of lengths 
from about 10 to 5000 nucleotides long, although they will typically be about 20 
to 500 nucleotides in length. Hybridization methods are extremely well known in 
the art and will not be described further here. 

15 In a related embodiment, detection of hypocretin genes can be 

conducted by primer extension reactions such as the polymerase chain reaction 
(PCR), To that end, PCR primers are utilized in pairs, as is well known, based 
on the nucleotide sequence of the gene to be detected. Particularly preferred 
PCR primers can be derived from any portion of a hypocretin DNA sequence, 

20 but are preferentially from regions which are not conserved in other cellular 
proteins. 

A preferred PCR primer pair useful for detecting hypocretin genes 
and hypocretin gene expression are described in the Examples. Nucleotide 
primers from the corresponding region of hypocretin described herein are readily 
25 prepared and used as PCR primers for detection of the presence or expression of 
the corresponding gene in any of a variety of tissues. 

3. Expression Vectors 

In addition, the invention includes a recombinant DNA molecule 
(recombinant DNA) containing a DNA segment of this invention encoding a 
30 hypocretin protein as described herein. A recombinant DNA can be produced by 
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operatively linking a vector to a DNA segment of the present invention. 

The choice of vector to which a DNA segment of the present 
invention is operatively linked depends directly, as is well known in the art, on 
the functional properties desired, e.g., protein expression, and the host cell to be 
transformed, these being limitations inherent in the art of constructing 
recombinant DNA molecules. However, a vector of the present invention is at 
least capable of directing the replication, and preferably also expression, of a 
hypocretin structural gene included in DNA segments to which it is operatively 
linked. 

In one embodiment, a vector of the present invention includes a 
procaryotic replicon, i.e., a DNA sequence having the ability to direct 
autonomous replication and maintenance of the recombinant DNA molecule 
extrachromosomally in a procaryotic host cell, such as a bacterial host cell, 
transformed therewith. Such replicons are well known in the art. In addition, 
those embodiments that include a procaryotic replicon also include a gene whose 
expression confers drug resistance to a bacterial host transformed therewith. 
Typical bacterial drug resistance genes are those that confer resistance to 
ampicillin or tetracycline. 

Those vectors that include a procaryotic replicon can also include a 
procaryotic promoter capable of directing the expression (transcription and 
translation) of a hypocretin gene in a bacterial host cell, such as E. coli, 
transformed therewith. A promoter is an expression control element formed by a 
DNA sequence that permits binding of RNA polymerase and transcription to 
occur. Promoter sequences compatible with bacterial hosts are typically provided 
in plasmid vectors containing convenient restriction sites for insertion of a DNA 
segment of the present invention. Typical of such vector plasmids are pUC8, 
pUC9, pBR322 and pBR329 available from Biorad Laboratories, (Richmond, 
CA), pRSET available from Invitrogen (San Diego, CA) and pPL and pKK223 
available from Pharmacia, Piscataway, N.J. 

Expression vectors compatible with eucaryotic cells, preferably 
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those compatible with vertebrate cells, can also be used to form the recombinant 
DNA molecules of the present invention. Eucaryotic cell expression vectors are 
well known in the art and are available from several commercial sources. 
Typically, such vectors are provided containing convenient restriction sites for 
5 insertion of the desired DNA segment. Typical of such vectors are pSVL and 
pKSV-10 (Pharmacia), pBPV-l/pML2d (International Biotechnologies, Inc.), 
pTDTl (ATCC, #31255), pRc/CMV (Invitrogen, Inc.), the vector pCMV4 
described herein, and the like eucaryotic expression vectors. 

In preferred embodiments, the eucaryotic cell expression vectors 

10 used to construct the recombinant DNA molecules of the present invention 

include a selection marker that is effective in an eucaryotic cell, preferably a drug 
resistance selection marker. A preferred drug resistance marker is the gene 
whose expression results in neomycin resistance, i.e., the neomycin 
phosphotransferase (neo) gene. Southern et al., J. Mol. Appl. Genet. . 1:327- 

15 341 (1982). Alternatively, the selectable marker can be present on a separate 
plasmid, and the two vectors are introduced by co-transfection of the host cell, 
and selected by culturing in the appropriate drug for the selectable marker. 
4. Inhibitory Nucleic Acids 

In accordance with one embodiment of the invention, nucleic acid 
20 molecules can be used in methodologies for the inhibition of hypocretin gene 

expression, thereby inhibiting the function of the hypocretin: hypocretin receptor 
binding interaction by blocking hypocretin expression. 

To that end, the invention includes isolated nucleic acid molecules, 
preferably single-stranded nucleic acid molecules (oligonucleotides), having a 
25 sequence complementary to a portion of a structural gene encoding a hypocretin 
protein of this invention. Nucleic acid-based inhibition is well known and 
generally referred to as "anti-sense" technology by virtue of the use of nucleotide 
sequences having complementarily which can hybridize to the "sense" strand or 
mRNA, and thereby perturb gene expression. Typical oligonucleotides for 
30 this purpose are about 10 to 5,000, preferably about 20-1000, nucleotides in 
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length and have a sequence capable of hybridizing specifically with a structural 
protein region of the nucleotide sequence that encodes a hypocretin protein of this 
invention. 

In one embodiment, the invention includes repetitive units of the 
nucleotide sequence complementary to a portion of a hypocretin structural gene so 
as to present multiple sites for complementary binding to the structural gene. 
This feature may be provided in a single nucleic acid segment having repeating 
sequences defining multiple portions of a structural gene, by physical conjugation 
of DNA segments each containing a single portion of a structural gene, or a 
combination thereof comprising conjugates of DNA segments, each having one or 
more sequences complementary to a structural gene. 

Nucleotide base modifications can be made to provide certain 
advantages to a DNA segments of this invention, referred to as nucleotide 
analogs. A nucleotide analog refers to moieties which function similarly to 
nucleotide sequences in a nucleic acid molecule of this invention but which have 
non-naturally occurring portions. Thus, nucleotide analogs can have altered sugar 
moieties or inter-sugar linkages. Exemplary are the phosphorothioate and other 
sulfur-containing species, analogs having altered base units, or other 
modifications consistent with the spirit of this invention. 

Preferred modifications include, but are not limited to, the ethyl or 
methyl phosphorate modifications disclosed in U.S. Patent No. 4,469,863 and the 
phosphorothioate modified deoxyribonucleotides described by LaPlanche et al., 
Nucl. Acids Res. . 14:9081, 1986; and Stec et al., J Am. Chem. Soc. 106:6077, 
1984. These modifications provide resistance to nucleolytic degradation, thereby 
contributing to the increased half-life in therapeutic modalities. Preferred 
modifications are the modifications of the 3' -terminus using phosphothioate (PS) 
sulfurization modification described by Stein et al., Nucl. Acids Res. , 16:3209, 
1988. 

In accordance with the methods of this invention in certain 
preferred embodiments, at least some of the phosphodiester bonds of the 
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nucleotide sequence can be substituted with a structure which functions to 
enhance the ability of the compositions to penetrate into the region of cells where 
the hypocretin structural gene to be inhibited is located. It is preferred that such 
linkages be sulfur containing as discussed above, such as phosphorotioate bonds. 
5 Other substitutions can include alkyl phosphothioate bonds, N-alkyl 

phosphoramidates, phosphorodithioates, alkyl phosphonates, and short chain alkyl 
or cycloalkyl structures. In accordance with other preferred embodiments, the 
phosphodiester bonds are substituted with structures which are, at once, 
substantially non-ionic and non-chiral. 

10 D. Anti-Hvpocretin Antibodies 

An antibody of the present invention, i.e., an anti-hypocretin 
antibody, in one embodiment is characterized as comprising antibody molecules 
that immunoreact with a hypocretin protein of this invention. Preferably, an 
antibody further immunoreacts with a hypocretin protein in situ , i.e., in a tissue 

15 section. 

The invention describes an anti-hypocretin antibody that 
immunoreacts with any of the hypocretin polypeptides of this invention, 
preferably also immunoreacts with the corresponding recombinant hypocretin 
protein, and more preferably also reacts with a native protein in situ in a tissue 

20 section. Preferably, the antibody is substantially free from immunoreaction with 
other proteins or neuropeptides other than hypocretin. Assays for 
immunoreaction useful for assessing immunoreactivity are described herein. 

In one embodiment, antibody molecules are described that 
immunoreact with a hypocretin receptor polypeptide of the present invention and 

25 that have the capacity to immunoreact with an exposed site on hypocretin that is 
required for hypocretin receptor binding. Thus, preferred antibody molecules in 
this embodiment also inhibit hypocretin receptor function, and are therefore 
useful therapeutically to block the receptor's function. 

Exemplary hypocretin inhibitory antibodies immunoreact with a 

30 hypocretin polypeptide described herein that defines an exposed region of a 
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hypocretin protein that is involved in hypocretin receptor function, such as ligand 
binding. 

An antibody of the present invention is typically produced by 
immunizing a mammal with an inoculum containing a hypocretin polypeptide of 
this invention and thereby induce in the mammal antibody molecules having 
immunospecificity for the immunizing polypeptide. The antibody molecules are 
then collected from the mammal and isolated to the extent desired by well known 
techniques such as, for example, by using DEAE Sephadex to obtain the IgG 
fraction. Exemplary antibody preparation methods using hypocretin polypeptides 
in the immunogen are described herein in the Examples. 

The preparation of antibodies against polypeptide is well known in 
the art. See Staudt et al., J. Exp. Med. , 157:687-704 (1983), or the teachings of 
Sutcliffe, J.G., as described in United States Patent No. 4,900,811, the teachings 
of which are hereby incorporated by reference. 

Briefly, to produce a hypocretin peptide antibody composition of 
this invention, a laboratory mammal is inoculated with an immunologically 
effective amount of a hypocretin polypeptide, typically as present in a vaccine of 
the present invention. The anti-hypocretin antibody molecules thereby induced 
are then collected from the mammal as an antiserum and those immunospecific 
for both a hypocretin polypeptide and the corresponding recombinant hypocretin 
protein are isolated to the extent desired by well known techniques such as, for 
example, by immunoaffinity chromatography. Alternatively, the antiserum may 
be used. 

To enhance the specificity of the antibody, the antibodies are 
preferably purified by immunoaffinity chromatography using solid phase-affixed 
immunizing polypeptide. The antibody is contacted with the solid phase-affixed 
immunizing polypeptide for a period of time sufficient for the polypeptide to 
immunoreact with the antibody molecules to form a solid phase-affixed 
immunocomplex. The bound antibodies are separated from the complex by 
standard techniques. 
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The word "inoculum" in its various grammatical forms is used 
herein to describe a composition containing a hypocretin polypeptide of this 
invention as an active ingredient used for the preparation of antibodies against a 
hypocretin polypeptide. When a polypeptide is used in an inoculum to induce 
5 antibodies it is to be understood that the polypeptide can be used in various 
embodiments, e.g., alone or linked to a carrier as a conjugate, or as a 
polypeptide polymer. However, for ease of expression and in context of a 
polypeptide inoculum, the various embodiments of the polypeptides of this 
invention are collectively referred to herein by the term "polypeptide" and its 

10 various grammatical forms. 

For a polypeptide that contains fewer than about 35 amino acid 
residues, it is preferable to use the peptide bound to a carrier for the purpose of 
inducing the production of antibodies. 

One or more additional amino acid residues can be added to the 

15 amino- or carboxy-termini of the polypeptide to assist in binding the polypeptide 
to a carrier. Cysteine residues added at the amino- or carboxy-termini of the 
polypeptide have been found to be particularly useful for forming conjugates via 
disulfide bonds. However, other methods well known in the art for preparing 
conjugates can also be used. 

20 The techniques of polypeptide conjugation or coupling through 

activated functional groups presently known in the art are particularly applicable. 
See, for example, Aurameas, et al., Scand. J. Immunol. , Vol. 8, Suppl. 7:7-23 
(1978) and U.S. Patent No. 4,493,795, No. 3,791,932 and No. 3,839,153. In 
addition, a site-directed coupling reaction can be carried out so that any loss of 

25 activity due to polypeptide orientation after coupling can be minimized. See, for 
example, Rodwell et al., Biotech. , 3:889-894 (1985), and U.S. Patent No. 
4,671,958. 

Exemplary additional linking procedures include the use of Michael 
addition reaction products, di-aldehydes such as glutaraldehyde, Klipstein, et al., 
30 J. Infect. Pis. . 147:318-326 (1983) and the like, or the use of carbodiimide 
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technology as in the use of a water-soluble carbodiimide to form amide links to 
the carrier. Alternatively, the heterobifunctional cross-linker SPDP (N- 
succinimidyl-3-(2-pyridyldithio) proprionate)) can be used to conjugate peptides, 
in which a carboxy -terminal cysteine has been introduced. 

Useful carriers are well known in the art, and are generally 
proteins themselves. Exemplary of such carriers are keyhole limpet hemocyanin 
(KLH), edestin, thyroglobulin, albumins such as bovine serum albumin (BSA) or 
human serum albumin (HSA), red blood cells such as sheep erythrocytes (SRBC), 
tetanus toxoid, cholera toxoid as well as poly amino acids such as poly D- 
lysine :D-glutamic acid, and the like. 

The choice of carrier is more dependent upon the ultimate use of 
the inoculum and is based upon criteria not particularly involved in the present 
invention. For example, a carrier that does not generate an untoward reaction in 
the particular animal to be inoculated should be selected. 

The present inoculum contains an effective, immunogenic amount 
of a polypeptide of this invention, typically as a conjugate linked to a carrier. 
The effective amount of polypeptide per unit dose sufficient to induce an immune 
response to the immunizing polypeptide depends, among other things, on the 
species of animal inoculated, the body weight of the animal and the chosen 
inoculation regimen is well known in the art. Inocula typically contain 
polypeptide concentrations of about 10 micrograms (jig) to about 500 milligrams 
(mg) per inoculation (dose), preferably about 50 micrograms to about 50 
milligrams per dose. 

The term "unit dose" as it pertains to the inocula refers to 
physically discrete units suitable as unitary dosages for animals, each unit 
containing a predetermined quantity of active material calculated to produce the 
desired immunogenic effect in association with the required diluent; i.e., carrier, 
or vehicle. The specifications for the novel unit dose of an inoculum of this 
invention are dictated by and are directly dependent on (a) the unique 
characteristics of the active material and the particular immunologic effect to be 
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achieved, and (b) the limitations inherent in the art of compounding such active 
material for immunologic use in animals, as disclosed in detail herein, these being 
features of the present invention. 

Inocula are typically prepared from the dried solid polypeptide- 
conjugate by dispersing the poly pep tide-conjugate in a physiologically tolerable 
(acceptable) diluent such as water, saline or phosphate-buffered saline to form an 
aqueous composition. 

Inocula can also include an adjuvant as part of the diluent. 
Adjuvants such as complete Freund's adjuvant (CFA), incomplete Freund's 
adjuvant (IFA) and alum are materials well known in the art, and are available 
commercially from several sources. 

The antibody so produced can be used, inter aha, in the diagnostic 
methods and systems of the present invention to detect hypocretin present in a 
sample such as a tissue section or body fluid sample. Anti-hypocretin antibodies 
that inhibit hypocretin function can also be used in vivo in therapeutic methods as 

described herein. 

A preferred anti-hypocretin antibody is a monoclonal antibody. A 
preferred monoclonal antibody of this invention comprises antibody molecules 
that immunoreact with a hypocretin polypeptide of the present invention as 
described for the anti-hypocretin antibodies of this invention. More preferably, 
the monoclonal antibody also immunoreacts with recombinantly produced whole 

hypocretin protein. 

A monoclonal antibody is typically composed of antibodies 
produced by clones of a single cell called a hybridoma that secretes (produces) 
only one kind of antibody molecule. The hybridoma cell is formed by fusing an 
antibody-producing cell and a myeloma or other self-perpetuating cell line. The 
preparation of such antibodies was first described by Kohler and Milstein, 
Nature , 256:495-497 (1975), the description of which is incorporated by 
reference. The hybridoma supernates so prepared can be screened for the 
presence of antibody molecules that immunoreact with a hypocretin polypeptide, 
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or for inhibition of hypocretin binding to hypocretin receptor as described herein. 

Briefly, to form the hybridoma from which the monoclonal 
antibody composition is produced, a myeloma or other self-perpetuating cell line 
is fused with lymphocytes obtained from the spleen of a mammal 
5 hyperimmunized with a hypocretin antigen, such as is present in a hypocretin 
polypeptide of this invention. The polypeptide-induced hybridoma technology is 
described by Niman et al., Proc, Natl. Acad. Sci.. USA . 80:4949-4953 (1983), 
the description of which is incorporated herein by reference. 

It is preferred that the myeloma cell line used to prepare a 

10 hybridoma be from the same species as the lymphocytes. Typically, a mouse of 
the strain 129 G1X + is the preferred mammal. Suitable mouse myelomas for use 
in the present invention include the hypoxanthine-aminopterin-thymidine-sensitive 
(HAT) cell lines P3X63-Ag8.653, and Sp2/0-Agl4 that are available from the 
American Type Culture Collection, Rockville, MD, under the designations CRL 

15 1580 and CRL 1581, respectively. 

Splenocytes are typically fused with myeloma cells using 
polyethylene glycol (PEG) 1500. Fused hybrids are selected by their sensitivity 
to HAT. Hybridomas producing a monoclonal antibody of this invention are 
identified using the enzyme linked immunosorbent assay (ELISA) described in the 

20 Examples. 

A monoclonal antibody of the present invention can also be 
produced by initiating a monoclonal hybridoma culture comprising a nutrient 
medium containing a hybridoma that produces and secretes antibody molecules of 
the appropriate polypeptide specificity. The culture is maintained under 
25 conditions and for a time period sufficient for the hybridoma to secrete the 

antibody molecules into the medium. The antibody-containing medium is then 
collected. The antibody molecules can then be further isolated by well known 
techniques. 

Media useful for the preparation of these compositions are both 
30 well known in the art and commercially available and include synthetic culture 
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media, inbred mice and the like. An exemplary synthetic medium is Dulbecco's 
Minimal Essential Medium (DMEM; Dulbecco et al., Virol. 8:396 (1959)) 
supplemented with 4.5 gm/1 glucose, 20 mM glutamine, and 20% fetal calf 
serum. An exemplary inbred mouse strain is the Balb/c. 

Other methods of producing a monoclonal antibody, a hybridoma 
cell, or a hybridoma cell culture are also well known. See, for example, the 
method of isolating monoclonal antibodies from an immunological repertoire as 
described by Sastry, et al., Proc. Natl. Acad. ScL USA . 86:5728-5732 (1989); 
and Huse et al., Science . 246:1275-1281 (1989). 

The monoclonal antibodies of this invention can be used in the 
same manner as disclosed herein for antibodies of the present invention. 

For example, the monoclonal antibody can be used in the 
therapeutic, diagnostic or in vitro methods disclosed herein where 
immunoreaction with hypocretin is desired. 

Also included in this invention is the hybridoma cell, and cultures 
containing a hybridoma cell that produce a monoclonal antibody of this invention. 

E. Diagnostic Methods 

The present invention includes various assay methods for 
determining the presence, and preferably amount, of hypocretin in a body sample 
such as a tissue sample, including tissue mass or tissue section, or in a biological 
fluid sample using a polypeptide, polyclonal antibody or monoclonal antibody of 
this invention as an immunochemical reagent to form an immunoreaction product 
whose amount relates, either directly or indirectly, to the amount of hypocretin in 
the sample. 

Those skilled in the art will understand that there are numerous 
well known clinical diagnostic chemistry procedures in which an immunochemical 
reagent of this invention can be used to form an immunoreaction product whose 
amount relates to the amount of hypocretin in a body sample. Thus, while 
exemplary assay methods are described herein, the invention is not so limited. 

For example, in view of the demonstrated property that hypocretin 
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binds a hypocretin receptor, a hypocretin protein of this invention can be used 
directly as a probe for detection of a hypocretin receptor by binding thereto. 

Additionally, one can use a nucleic acid molecule probes described 
herein to detect the presence in a cell or tissue of a hypocretin gene or expressed 
5 gene in the form of mRNA encoding a hypocretin protein of this invention, as 
described further herein. Suitable probe-based assays are described by Sutcliffe 
in United States Patent Nos. 4,900,811 and 5,242,798, the disclosures of which 
are incorporated by reference. 

Various heterogenous and homogeneous protocols, either 
10 competitive or noncompetitive, can be employed in performing an assay method 
of this invention. 

For example, one embodiment includes a method for assaying the 
amount of hypocretin protein in a sample that utilizes an anti-hypocretin antibody 
to immunoreact with hypocretin protein in a sample. In this embodiment, the 
15 antibody immunoreacts with hypocretin to form a hypocretin-antibody 

immunoreaction complex, and the complex is detected indicating the presence of 
hypocretin in the sample. 

An immunoassay method using an anti-hypocretin antibody 
molecule for assaying the amount of hypocretin in a sample typically comprises 
20 the steps of: 

(a) Forming an immunoreaction admixture by admixing 
(contacting) a sample with an anti-hypocretin antibody of the present invention, 
preferably a monoclonal antibody. The sample is typically in the form of a fixed 
tissue section in a solid phase such that the immunoreaction admixture has both a 

25 liquid phase and a solid phase, and the antibody functions as a detection reagent 
for the presence of hypocretin in the sample. 

Preferably, the sample is a brain tissue sample that has been 
prepared for immunohistological staining as is well known, although other tissue 
samples may be adsorbed onto a solid phase, including tissue extracts or body 

30 fluid. In that case the adsorption onto a solid phase can be conducted as 
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described for well known Western blot procedures. 

(b) The immunoreaction admixture is maintained under 
biological assay conditions for a predetermined time period such as about 10 
minutes to about 16-20 hours at a temperature of about 4 degree Celsius to about 
5 45 degree Celsius that, such time being sufficient for the hypocretin present in the 
sample to immunoreact with (immunologically bind) the antibody and form a 
hypocretin-containing immunoreaction product (immunocomplex). 

Biological assay conditions are those that maintain the biological 
activity of the immunochemical reagents of this invention and the hypocretin 
10 sought to be assayed. Those conditions include a temperature range of about 4 
degree Celsius to about 45 degree Celsius, a pH value range of about 5 to about 
9 and an ionic strength varying from that of distilled water to that of about one 
molar sodium chloride. Methods for optimizing such conditions are well known 
in the art. 

15 (c) The presence, and preferably amount, of hypocretin- 

containing immunoreaction product that formed in step (b) is determined 
(detected), thereby determining the amount of hypocretin present in the sample. 

Determining the presence or amount of the immunoreaction 
product, either directly or indirectly, can be accomplished by assay techniques 
20 well known in the art, and typically depend on the type of indicating means used. 

Preferably, the determining of step (c) comprises the steps of: 

(i) admixing the hypocretin-containing immunoreaction 
product with a second antibody to form a second (detecting) immunoreaction 
admixture, said second antibody molecule having the capacity to immunoreact 

25 with the first antibody (primary) in the immunoreaction product. 

Antibodies useful as the second antibody include polyclonal or 
monoclonal antibody preparations raised against the primary antibody. 

(ii) maintaining said second immunoreaction admixture 
for a time period sufficient for said second antibody to complex with the 

30 immunoreaction product and form a second immunoreaction product, and 
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(iii) determining the amount of second antibody present in 
the second immunoreaction product and thereby the amount of immunoreaction 
product formed in step (c). 

In one embodiment, the second antibody is a labeled antibody (i.e., 
detecting antibody) such that the label provides an indicating means to detect the 
presence of the second immunoreaction product formed. The label is measured 
in the second immunoreaction product, thereby indicating the presence, and 
preferably amount, of second antibody in the solid phase. 

Alternatively, the amount of second antibody can be determined by 
preparation of an additional reaction admixture having an indicating means that 
specifically reacts with (binds to) the second antibody, as is well known. 
Exemplary are third immunoreaction admixtures with a labeled anti- 
immunoglobulin antibody molecule specific for the second antibody. After third 
immunoreaction, the formed third immunoreaction product is detected through the 
presence of the label. 

Exemplary methods involve the use of in situ immunoreaction 
methods using tissue sections, or Western blot procedures, as described by 
Sutcliffe in United States Patent No. 4,900,811. 

Another embodiment is a method for assaying the amount of 
therapeutically administered hypocretin protein or anti-hypocretin antibody in a 
body fluid sample such as cerebrospinal fluid (CSF), blood, plasma or serum. 
The method utilizes a competition reaction in which either a hypocretin 
polypeptide or an anti-hypocretin antibody molecule of this invention is present in 
the solid phase as an immobilized immunochemical reagent, and the other of the 
two reagents is present in solution in the liquid phase, in the form of a labeled 
reagent. A fluid sample is admixed thereto to form a competition 
immunoreaction admixture, and the resulting amount of label in the solid phase is 
proportional, either directly or indirectly, to the amount of hypocretin polypeptide 
or antibody in the fluid sample, depending upon the format. 

One version of this embodiment comprises the steps of: 
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(a) Forming a competition immunoreaction admixture by 
admixing (contacting) a fluid sample with: 

(1) an anti-hypocretin antibody according to this invention 
containing antibody molecules that immunoreact with a hypocretin protein of this 
invention, said antibody being operatively linked to a solid matrix such that the 
competition immunoreaction admixture has both a liquid phase and a solid phase, 
and 

(2) a polypeptide or recombinant hypocretin protein of the 
present invention that is immunoreactive with the added antibody. The admixed 
polypeptide/protein in the liquid phase (labeled competing antigen) is operatively 
linked to an indicating means as described herein. 

(b) The competition immunoreaction admixture is then 
maintained for a time period sufficient for the competing antigen and the body 
sample antigen present in the liquid phase to compete for immunoreaction with 
the solid phase antibody. Such immunoreaction conditions are previously 
described, and result in the formation of an indicating means-containing 
immunoreaction product comprising the labeled competing antigen in the solid 
phase. 

(c) The amount of indicating means present in the product 
formed in step (b) is then determined, thereby determining the presence, and 
preferably amount, of sample antigen present in the fluid sample. 

Determining the indicating means in the solid phase is then 

conducted by the standard methods described herein. 

A reverse version of this embodiment comprises the steps of: 
(a) Forming a competition immunoreaction admixture by 

admixing a fluid sample with: 

(1) an anti-hypocretin antibody according to the present 

invention; and 

(2) a hypocretin polypeptide or recombinant hypocretin protein 
of the present invention (capture antigen) that is immunoreactive with the 
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antibody and is operatively linked to a solid matrix such that the competition 
immunoreaction admixture has both a liquid phase and a solid phase. 

(b) The competition immunoreaction admixture is then 
maintained for a time period sufficient for any hypocretin antigen or anti- 

5 hypocretin antibody in the fluid to compete with the admixed antibody molecules 
for immunoreaction with the solid phase capture antigen and form an antibody- 
containing immunoreaction product in the solid phase. 

(c) The amount of antibody present in the product formed in 
step (b) is then determined, thereby determining the presence and amount of 

10 target material in the fluid sample. 

In preferred embodiments, the antibody is operatively linked to an 
indicating means such that the determining in step (c) comprises determining the 
amount of indicating means present in the product formed in step (b). 

Preferably, the fluid sample is provided to a competition 

15 immunoreaction admixture as a known amount of CSF, blood, or a blood derived 
product such as serum or plasma. Further preferred are embodiments wherein 
the amount of immunochemical reagent in the liquid phase of the immunoreaction 
admixture is an excess amount relative to the amount of reagent in the solid 
phase. Typically, a parallel set of competition immunoreactions are established 

20 using a known amount of purified recombinant hypocretin or polypeptide in a 
dilution series so that a standard curve can be developed, as is well known. 
Thus, the amount of product formed in step (c) when using a fluid sample is 
compared to the standard curve, thereby determining the amount of target antigen 
present in the fluid. 

25 In another embodiment, the method for assaying the amount of 

hypocretin in a sample utilizes a first capture antibody to capture and immobilize 
hypocretin in the solid phase and a second indicator antibody to indicate the 
presence of the captured hypocretin antigen. In this embodiment, one antibody 
immunoreacts with a hypocretin protein to form a hypocretin-antibody 

30 immunoreaction complex, and the other antibody is able to immunoreact with the 
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hypocretin while present in the hypocretin-antibody immunoreaction complex. 
This embodiment can be practiced in two formats with the immobilized capture 
antibody being either of the two above-identified antibodies, and the indicator 
antibody being the other of the two antibodies. 

Where an antibody is in the solid phase as a capture reagent, a 
preferred means for determining the amount of solid phase reaction product is by 
the use of a labeled hypocretin polypeptide, followed by the detection means 
described herein for other labeled products in the solid phase. 

Also included are immunological assays capable of detecting the 
presence of immunoreaction product formation without the use of a label. Such 
methods employ a "detection means", which means are themselves well-known in 
clinical diagnostic chemistry and constitute a part of this invention only insofar as 
they are utilized with otherwise novel polypeptides, methods and systems. 
Exemplary detection means include methods known as biosensors and include 
biosensing methods based on detecting changes in the reflectivity of a surface, 
changes in the absorption of an evanescent wave by optical fibers or changes in 
the propagation of surface acoustical waves. 

Alternative methods of expression, amplification, and purification 
will be apparent to the skilled artisan. Representative methods are disclosed in 
Sambrook, Fritsch, and Maniatis, eds. Molecular Cloning, a Laboratory Manual 
2nd Ed., Cold Spring Harbor Laboratory (1989) and in Ausabel et al., eds., 
Current Protocols in Molecular Biology, Wiley & Sons, Inc., New York (1989). 
D. S pecific Methods 

Directional tag PCR subtractive hybridization was used to enrich a 
cDNA library for clones of mRNA species selectively expressed in the 
hypothalamus. Candidate clones identified by their hybridization to a subtracted 
hypothalamus probe were validated in three stages. First, a high throughput 
cDNA library Southern blot was used to demonstrate that the candidate 
corresponded to a species enriched in the subtracted library. Second, candidate 
clones positive in the first assay were used as probes for Northern blots with 
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RNA from several brain regions and peripheral tissues. Finally, candidate clones 
that were still positive were subjected to in situ hybridization analysis to detect 
the hypothalamic regions that express the corresponding mRNAs. 

Typically, subtractive hybridization protocols utilize a single target- 
5 driver dichotomy for enrichment of target-specific species. In the current study, 
a two-step subtraction protocol, first depleting hypothalamus sequences with a 
cerebellum driver, and then with a hippocampus driver, was employed. Previous 
studies using single step subtraction methodology had been successful in finding 
clones of species enriched in a target compared to the single driver tissue, only to 

10 find considerable expression in other brain regions. The present protocol was 
designed to provide a more stringent selection for clones of mRNAs with high 
selectivity for the target. Grids of the subtracted library were prepared and 
probed as described by Usui, H., Falk, J.D., Dopazo, A., de Lecea, L., 
Erlander, M.G., & Sutcliffe, J.G., /. NeuroscL 14:4915-4926 (1994). DNA 

15 sequence analysis, Northern blotting and in situ hybridization were performed as 
described by Usui et ah, supra, and de Lecea, L., Soriano, E., Criado, J.R., 
Steffensen, S.C., Henriksen, S.J., & Sutcliffe, J.G., Molec. Brain Res 25:286- 
296 (1994). 

In situ hybridization analysis was performed essentially as 
20 described by Gall, CM. & Isackson, P.M., Science 245: 758-761 (1989) and by 
Erlander, M.G., et al., Proc. Natl. Acad. ScL USA 90: 3452-3456 (1993). 
Coronal sections about 25 fxm thick cut from brains of adult Sprague-Dawley rats 
were hybridized at 55 degrees Celsius for 16 hours with 35 S-labelled single- 
stranded RNA probes at 10 7 counts per minute per ml. Free-floating sections 
25 were treated with Rnase A at 4 /xg/ml at 37 degrees Celsius for 1 hour and 
washed in lx SSC (15 mM NaCl, 1.5 mM Na citrate), 50% formamide at 
55 degrees Celsius for 2 hours. Final stringency washes were in 0.1 x SSC at 68 
degrees Celsius for 1 hour. Sections were mounted on coated slides, dehydrated 
and exposed to Kodak XAR film for 5 days at room temperature. 
30 For cDNA library Southern blotting, 2 /ug of each library was 
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digested with Haelll, separated by electrophoresis, transferred to nylon 
membranes, and hybridized to individual clones, as described in Usui et al, 
supra. 

To recognize mRNAs that are selectively expressed in the 
5 hypothalamus, poly(A)-enriched cytoplasmic RNA from carefully dissected rat 

and mouse hypothalami were prepared. Target cDNA libraries in vector pT7T3D 
(Pharmacia Biotech, Piscataway, NJ) and driver libraries in pGEMHZf(-) 
(Promega, Madison, WI) from analogously prepared cerebellar and hippocampal 
RNA samples were constructed. The directional tag PCR subtractive 

10 hybridization method of Usui and colleagues in Usui et al., supra was applied to 
produce tagged hypothalamic cDNAs from which cerebellar and hippocampal 
sequences were depleted in two consecutive steps, removing more than 97% of 
the input target cDNA. The tag sequences were used as PCR primer-binding 
sites to amplify the remaining material. An aliquot of the amplified product was 

15 cloned into pBCSK + (Stratagene, La Jolla, CA) to generate a subtracted 

hypothalamus library with 5xl0 5 members, with inserts ranging from 400 to 1200 
(average 700) nucleotide pairs, as judged by agarose gel electrophoresis of the 
released inserts. 

To validate the efficiency of the subtraction, the degree of 

20 depletion in the subtracted library of sequences known to be expressed 

panneurally and the enrichment of sequences known to be expressed specifically 
in the hypothalamus was determined. Dot blots were prepared with dilutions of 
cDNA clones of the mRNAs encoding the following proteins: panneural neuron- 
specific enolase, ubiquitously expressed cyclophilin, hypothalamus-specific 

25 vasopressin, hypothalamus-enriched proopiomelanocortin (POMC), thalamus- 

specific protein kinase CS, and pituitary-specific growth hormone, as well as the 
target vector itself. The blots were probed with cDNA inserts amplified by PCR 
from the unsubtracted target library, the subtracted target library or a pool of the 
driver libraries (Fig. 1). The driver and unsubtracted-library probes gave strong 

30 signals for cyclophilin and neuron-specific enolase, and a weaker signal for 
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POMC. Neither hippocampus nor cerebellum is known to express POMC. 
Although this finding could be explained if one of the drivers had suffered 
contamination with mRNA from another structure, for example brain stem, the 
studies below suggest that the signal with the driver libraries was probably due to 

5 background hybridization to sequences in the POMC clone. The unsubtracted 

target additionally gave a weak signal for vasopressin. The subtracted probe gave 
a very strong signal for vasopressin and POMC and otherwise only faint or 
undetectable signals. The increase in strength of the vasopressin signal was 20- 
to-30 fold. Thus, the subtraction protocol removed abundant, panneurally 

10 expressed sequences nearly quantitatively while enriching for hypothalamus- 

specific sequences. There was no apparent contamination with sequences from 
the anatomically adjacent structures, thalamus or pituitary. The effectiveness of 
the subtraction was quantitated further by measuring the frequencies of VAT-1 
and oxytocin clones in the unsubtracted and subtracted libraries by colony 

15 hybridization with a probe corresponding to a mixture of clones of these two 

species. The frequency of positive clones in the unsubtracted target was 4/2775. 
After subtraction, the frequency increased to 33/1224. These frequencies indicate 
an approximately 19-fold increase in the specific activities of these known 
hypothalamus-enriched species, consistent with the estimates suggested by the 

20 data of Fig. 1. 

To identify species enriched by the subtraction, 648 clones from 
the subtracted library were placed into grid arrays and hybridized to three 
replicate blots of grid images with probes prepared from the unsubtracted or 
subtracted target library, or a pool of the driver libraries. Approximately 70% of 

25 the colonies gave significant signals with the subtracted target probe compared to 
50% with the unsubtracted target probe. Only 10% of the colonies gave signals 
with the mixed-driver probe. 

Plasmid DNA was prepared individually from 100 of the colonies 
that gave strong signals with target-derived probes but no signal with the mixed- 

30 driver probe. Partial sequences of the inserts were determined for 94 of these, 
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using a sequencing primer that annealed to the vector region adjacent to the 3' 
ends of the inserts. The remaining 6 clones were not pursued further because 
clear sequences were not obtained. More than 90% of the 3' sequences appeared 
to be derived from bona fide 3' ends of mRNAs as they contained recognizable 
5 poly(A)-addition consensus hexads (Birnstiel, M.L., Busslinger, M., & Strub, K. 
Cell 41:349-359, 1985) 12-22 nucleotides upstream from the poly(A) tracts used 
in their directional cloning. The sequences were searched by BLAST analysis 
(Altschul, S.F., Gish, W., Miller, W., Myers, E.W., & Lipman, D., J. Molec. 
BioL 215:403, 1990) against the GenBank database. For those that appeared to 

10 be novel, the sequence at the 5' end of the insert was also determined and 
compared with the database. 

A compilation of those data is presented in Table 1 and database 
accession numbers are given for those prototypes for which a match was found. 
The 94 clones from the subtracted library for which data were obtained 

15 corresponded to 43 distinct mRNA species. Twenty-nine of these were 

encountered only once in the set of 94 clones, while 14 species were seen 
between 2 to 13 times. Among the 43 distinct species were 21 that were 
unambiguously matched to known mRNA species and 22 that were novel species. 
Amongst the novel species were 6 that appear to correspond (greater than 80% 

20 nucleotide sequence identity across an extensive span) to rat homologues of so- 
called " expressed sequence tags" (ESTs), mRNAs of as yet unknown function 
compiled in the databases. Two species exhibited similarities in both their partial 
nucleotide sequences and putative encoded amino acid sequences that suggest 
them to represent members of protein families: a protein related to the VAT-1 

25 secretory vesicle protein (clone 6), and a new calmodulin-dependent protein 
kinase (clone 29, SEQ ID NO:5). 

The cDNA insert from at least one representative of each of the 43 
mRNA species was used as a probe in a Southern blot with lanes corresponding 
to the hypothalamus, hippocampus and cerebellum target and driver cDNA 

30 libraries, each cleaved with the restriction endonuclease Hae lll. Assuming that 
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the cDNA libraries are representative of the mRNAs expressed in their 
corresponding tissues, this assay serves as a low cost, high throughput surrogate 
for more expensive and time consuming Northern blot analyses. The 
hybridization results of the clones in this so-called "cDNA library Southern blot" 
5 assay were classified in one of five patterns (Table 1): hybridization to bands 
detected exclusively in the hypothalamus library (A), to bands highly enriched in 
hypothalamus but still detectable in hippocampus and cerebellum lanes (B), to 
bands in hypothalamus and hippocampus but not in cerebellum (C), to bands in 
all three tissues (D), or too faint to categorize (E). Examples of classes A-D are 
10 shown in Fig. 2. Twenty-three of the 43 distinct mRNA species were exclusive 
to or highly enriched in the hypothalamus library, and an additional 15 species 
were undetectable in the cerebellum library, indicating the effectiveness of the 
protocols for identifying species selectively present in the target library. It may 
be significant that the patterns classified as D corresponded to clones that were 
15 isolated only once; similarly, none of the species lacking a poly(A)-addition 
signal turned up more than once. The existence in the collection of species 
present in hippocampal, but not cerebellar, libraries presumably is explained by 
their enrichment during the first subtraction step with cerebellum driver to an 
extent that did not allow their complete depletion in the second step with 
20 hippocampus driver. POMC gave an A pattern in this assay, demonstrating that 
the driver libraries were not significantly contaminated with POMC-expressing 
structures. Thus the low POMC signal observed with the driver probes in Fig. 1 
is mostly likely accounted for by vector cross-hybridization. 

Northern blots were performed for 15 of the species that showed 
25 hypothalamus-enriched or -specific distributions (group A or B) in the cDNA 
Southern blot assay. The blots (Fig. 3) included RNA samples from 6 grossly 
dissected regions of rat brain in addition to pituitary, liver, kidney and heart. 
For the clones of species that had been isolated two or more times, the 
correspondence with the cDNA library Southern blot assay was excellent. Thus, 
30 clones 2 (oxytocin) and 35 (novel), which gave A patterns in the cDNA Southern 
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blot study, each detected a band that was strong in the hypothalamus lanes, but 
only very faint or undetectable in the other lanes. The faint signals were possibly 
due to low expression in those tissues or to contamination during tissue 
dissection. Clones 6 (VAT 1 -like), 10 (novel) and 12 (novel), which had given B 
5 patterns, each detected bands that were considerably more intense in the 

hypothalamus than hippocampus or cerebellum lanes, although each was detected 
in the pituitary lane (6 strongly) and in the samples from some other structures. 
Clones 3 (novel), 15 (novel) and 29 (novel calmodulin-dependent protein kinase), 
although classified originally as B patterns, are more properly considered as C 

10 patterns, as their expression profiles in this assay are not enriched in 
hypothalamus per se, but rather are low in the cerebellum. 

The clones encountered only once behaved, as a group, less well. 
Clones 21 (novel), 37 (novel), 98 (novel) and 99 (kinesin) failed to show 
substantial enrichment in hypothalamus over hippocampus or cerebellum 

15 (although 98 was thalamus enriched). However, clone 33 (novel) detected an 

RNA species more prevalent in hypothalamus and thalamus than cortex, pons or 
olfactory bulb and was undetectable in hippocampus, cerebellum or peripheral 
tissues; thus, technically speaking, clone 33 maintained its A pattern 
classification. Clone 20 (novel) detected an RNA species with ubiquitous 

20 expression but enrichment in hypothalamus and thalamus, thus it is more properly 
classified as B pattern. Clone 67 (novel) detected a species enriched in 
hypothalamus and olfactory bulb that was detectable in other brain regions and 
pituitary but was not detectable in cerebellum. 

In situ hybridization on coronal sections of brain from adult male 

25 rats was performed using the inserts from clones representing all four classes (A- 
D): 6, 10, 20, 21, 29 and 35. For all clones, the hybridization pattern was 
consistent with the Northern blot data. In the A class, the clone 35 mRNA 
displayed a striking pattern of bilaterally symmetric expression restricted to a few 
cells in the paraventricular hypothalamic area and ependymal cells surrounding 

30 the brain ventricles. No clone 35 signals were detected outside the hypothalamus. 
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The sequence of clone 35 is shown in Fig. 5. 

Clones 6, 10 and 20, belonging to class B, displayed somewhat 
more complex distributions. Clone 6 gave strong signals in the periventricular 
hypothalamic nucleus, anterior hypothalamic area, preoptic and arcuate nuclei. 
Very strong hybridization could also be seen in the centromedial thalamic nucleus 
and medial habenula. Clone 10 displayed almost the same pattern but additional 
strong signals could be seen in the laterodorsal thalamic nucleus and dentate 
gyrus, with weak signals in the hippocampal CA fields and the entire neocortex. 
Interestingly this mRNA showed a marked enrichment in basal diencephalic 
structures that included nuclei not only of the hypothalamus but also of the 
amygdaloid complex. Clone 20 exhibited low levels of expression in several 
areas of the brain, but displayed especially strong signals in the ventral 
hypothalamus, most notably in the anterior hypothalamic and periventricular 
nuclei. 

Clone 29 (class C, SEQ ID NO:5), which encodes a novel 
calmodulin kinase-like protein, was also very strongly expressed in the anterior 
hypothalamic area and arcuate nucleus, as well as in the pyramidal cell layer of 
all hippocampal fields and in the medial and central nuclei of the amygdala. The 
sequence of clone 29 is shown in Fig. 6. Clone 21 represents a class D cDNA, 
whose distribution includes hypothalamic as well as extrahypothalamic structures. 
In particular, the clone 21 mRNA was found in cortex, amygdala, hippocampus, 
caudate, and several thalamic (centrodorsal and reticular nuclei) and hypothalamic 
nuclei. Within the hypothalamus, clone 21 mRNA was especially abundant in the 
paraventricular hypothalamic nucleus. 

The data compiled in Table 1 suggest that this strategy was 
effective: 53 of the 94 clones studied were shown to correspond to mRNAs 
expressed in the hypothalamus at much higher concentrations than in either the 
hippocampus or cerebellum. An additional 32 of the clones were enriched in 
both hypothalamus and hippocampus over cerebellum, indicating that the first 
subtraction was more efficient, probably because the target concentration was 
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higher in the hybridization reaction, thus a greater portion of the common species 
were driven into hybrids. Cumulatively, 85 of the 94 candidates were found to 
be enriched in the target hypothalamus compared to the cerebellum, a quite 
acceptable success rate. It is noteworthy that in 8 cases, the cDNA library 
5 Southern blot assay suggested a higher degree of hypothalamus enrichment than 
was later observed by Northern blotting, presumably due to artif actual enrichment 
in the target libraries compared to the driver libraries. In a few cases this can be 
explained by artifactual cloning of an internal or intronic cDNA fragment. Other 
cases may be explained by difficulties in achieving proportional representation of 
10 low prevalence mRNAs in cDNA libraries. 

The subtraction steps provided an approximately 30-fold 
enrichment. In the secondary screen, approximately 60% of the clones were 
positive with the subtracted probe but not the target probe. Of the 94 clones 
selected from this screen, 53 were clones of mRNAs selectively expressed in 
15 hypothalamus. These 53 clones correspond to approximately 1% of the clones 
examined in this pilot study, and represented 16 distinct mRNA species, 
suggesting that a complete characterization of hypothalamus mRNAs might reveal 
100-200 species that were specific to or highly enriched in the hypothalamus. Of 
the 16 mRNA species detected here, 9 corresponded to already known proteins, 
20 among them oxytocin, vasopressin and POMC, three neuropeptides known to be 
highly enriched in the hypothalamus. However, 7 mRNA species were novel. 
Among mRNA species not detected in the 94-clone sample were those encoding 
the releasing factors, which are less abundant than most of the species detected 
here. 

25 Oxytocin and vasopressin mRNAs are predominantly associated 

with discrete hypothalamic nuclei, as was previously known. The in situ 
hybridization images indicate that several additional mRNAs, including several 
novel species, are enriched in the hypothalamus. Among the novel species, only 
clone 35 meets the hypothesis in its strictest sense: the mRNA appears to be 

30 restricted to nuclei in the paraventricular area of the hypothalamus. 
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Other mRNAs corresponding to novel clones exhibit enrichment in 
basal diencephalic structures, especially the hypothalamus, but within the 
hypothalamus none is restricted to a single nucleus. These species presumably 
encode proteins whose functions are not dedicated to single physiological systems. 
5 Nevertheless, their roles seem to have selective utility within the CNS. Previous 
studies looking at mRNAs enriched in the caudate revealed several involved in 
signal transduction pathways (Usui, et al., supra). That is not the finding for the 
hypothalamus-enriched species encountered thus far. 

The data suggest that the hypothalamus utilizes at least two 

10 different strategies for employing selectively expressed proteins. Some specific 

mRNAs are discretely correlated to distinct nuclei. Thus far, all of these mRNAs 
encode secretory signalling proteins. A class of mRNAs have also been 
recognized that are expressed prominently in hypothalamus and amygdala. These 
do not appear to be restricted to functionally discrete regions, but their 

15 comparable anatomical restrictions suggest that they might participate in a series 
of biochemical processes that are selectively distributed to these regions, which 
are developmentally related. Thus these regions may share molecular properties 
that are not apparent at the anatomical level. 

DNA sequence analysis of the complete 569 nucleotide rat clone 35 

20 revealed that the clone mRNA encodes a 130-residue putative secretory protein 
(called H35) or hypocretin with 4 sites for potential proteolytic maturation (Fig. 
5). Several proteolytic fragments have been identified, some replacing C- 
terminal glycines with amide groups. Two of the products of proteolysis have 14 
amino acid identities across 20 residues. This region of H35 includes a 7/7 

25 match with a region of the gut hormone secretin, suggesting that the 

prepropeptide gives rise to two peptide products that are structurally related both 
to each other and to secretin. 

The mouse homolog of clone 35 was also isolated and sequenced 
(Fig. 5). The mouse nucleotide sequence differs in 35 positions relative to the rat 

30 sequence and contains 16 additional nucleotides near its 3' end. Of these 
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differences, 19 nucleotides differ within the protein coding region. Only 7 of 
these affect the encoded protein sequence. One amino acid difference is a neutral 
substitution in the secretion signal sequence (residue 3). The remaining 6 
differences are in the C-terminal region. One of these obliterates a potential 
5 proteolytic cleavage site. This observation and the nature of the other differences 
make it unlikely that 2 of the possible maturation products of the rat 
preproprotein are functional. However, the 2 peptides that are related both to 
each other and to secretin are absolutely preserved between species, providing 
strong support for the notion that these peptides have a function conserved during 
10 evolution. 

The cells that express this mRNA are distributed in a bilaterally 
symmetrical pattern in a previously uncharted nucleus of the rat dorsal-lateral 
hypothalamus and sparse ependymal cells that line the ventricles suggesting that 
the peptides function as intercellular messengers within the CNS. Colocalization 

15 studies suggest a partial overlap with cells positive for galanin, bradykinin and 

dynorphin. The rat H35 mRNA is restricted to the CNS in the studies performed 
to date. It is not expressed at high concentrations in immature animals. 

These observations, along with the sequence data discussed above, 
suggest that the H35 peptides are secreted into the CSF and locally within the 

20 hypothalamus; that their functions are only manifested in mature animals; and that 
their expression is coupled to the general homeostatic status of the animal, 
although not regulated in an all-or-none fashion by homeostasis. In other words, 
these are new hormones that act within the central nervous system. 

The polypeptides may be expressed by transformation of a suitable 

25 host cell with a cDNA in a suitable expression vector. The choice of host cell is 
not critical. The polypeptide may be produced from a procaryotic (e.g. E. coli) 
or eucaryotic (mammalian, e.g. COS-7, CHO, NIH 3T3) host cell, as desired. 

The hypocretin polypeptides, and fragments thereof, of this 
invention are useful in diagnosis and therapy. Recombinant or natural 

30 polypeptides may be used in Western blot, ELISA, RIA, and the like, and in 
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receptor binding assays, for direct or competitive binding studies to identify 
hyprocretin specific receptors. The identification of hypocretin analogs and 
antagonists is also accomplished via use of the polypeptides identified herein. 
Further details of such uses are described in U. S. Patent No. 5,242,798, 

5 incorporated herein by reference. 

In another aspect, the polypeptides of this invention may be used to 
generate antibodies. Methods of preparing polyclonal antibodies are well known 
in the art. For example, an immunogenic conjugate comprising the hypocretin 
protein or a fragment thereof, optionally linked to a carrier protein, is used to 

10 immunize a selected mammal (mouse, rabbit, et al.). Serum from the immunized 
mammal is collected and treated to separate the immunoglobulin fraction. 
Monoclonal antibodies are prepared by standard hybridoma cell technology 
(Koller and Milstein, Nature 256:495-497 (1975)). Briefly, spleen cells are 
obtained from a host animal immunized with an hypocretin protein or fragment. 

15 Hybrid cells are formed by fusing these spleen cells with an appropriate myeloma 
cell line and cultured. The antibodies produced are screened for their ability to 
bind H35 by, for example, ELISA. The cells producing the hypocretin antibody 
are selected. 

Antibodies directed to a conserved epitope common to the hypocretin 
20 polypeptides of several species will detect hypocretin polypeptides of mammalian 
species in general. For example, antibodies directed against such a conserved 
sequence as GNHAAGILT (Fig. 5) can be used to detect human hypocretin 
polypeptides. 

The polynucleotides and polypeptides of this invention may also be 
25 formulated into diagnostic and therapeutic compositions. Representative methods 
of formulation may be found in Remington: The Science and Practice of 
Pharmacy, 19th ed., Mack Publishing Co., Easton, PA (1995). The selection of 
the precise concentration, composition, and delivery regimen is influenced by, 
inter alia, the specific pharmacological properties of the selected compound, the 
30 intended use, the nature and severity of the condition being treated or diagnosed, 
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and the physical condition and mental acuity of the intended recipient. Such 
considerations are within the purview of the skilled artisan. 

Representative delivery regimens include oral, parenteral 
(subcutaneous, intramuscular, and intravenous), rectal, buccal, pulmonary, 
5 transdermal, and intranasal, preferably intravenous. The composition may be in 
solid, liquid, gel, or aerosol form. Generally, the compound will be present in 
an amount from about 1 ^g to about 100 (xg, in a sterile aqueous solution, 
optionally including stabilizers and the like. 

The present invention also describes a diagnostic system, 

10 preferably in kit form, for assaying for the presence of a hypocretin of this 

invention in a body sample, such brain tissue, cell suspensions or tissue sections, 
or body fluid samples such as CSF, blood, plasma or serum, where it is desirable 
to detect the presence, and preferably the amount, of a hypocretin protein in the 
sample according to the diagnostic methods described herein. 

15 In a related embodiment, a nucleic acid molecule can be used as a 

probe (an oligonucleotide) to detect the presence of a gene or mRNA in a cell 
that is diagnostic for the presence or expression of a hypocretin in the cell. The 
nucleic acid molecule probes were described in detail earlier. 

The diagnostic system includes, in an amount sufficient to perform 

20 at least one assay, a subject hypocretin polypeptide, a subject antibody or 

monoclonal antibody, and a subject nucleic acid molecule probe of the present 
invention, as a separately packaged reagent. 

Another embodiment is a diagnostic system, preferably in kit form, 
for assaying for the presence of a hypocretin polypeptide or anti-hypocretin 

25 antibody in a body fluid sample such as for monitoring the fate of therapeutically 
administered hypocretin polypeptide or anti-hypocretin antibody. The system 
includes, in an amount sufficient for at least one assay, a subject hypocretin 
polypeptide and a subject antibody as a separately packaged immunochemical 
reagent. 

30 Instructions for use of the packaged reagent(s) are also typically 



WO 98/05352 



PCT/US97/13657 



included. 

As used herein, the term "package" refers to a solid matrix or 
material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a 

5 polypeptide, polyclonal antibody or monoclonal antibody of the present invention. 
Thus, for example, a package can be a glass vial used to contain milligram 
quantities of a hypocretin polypeptide or antibody or it can be a microliter plate 
well to which microgram quantities of a contemplated polypeptide or antibody 
have been operatively affixed, i.e., linked so as to be capable of being 

10 immunologically bound by an antibody or antigen, respectively. 

"Instructions for use" typically include a tangible expression 
describing the reagent concentration or at least one assay method parameter such 
as the relative amounts of reagent and sample to be admixed, maintenance time 
periods for reagent or sample admixtures, temperature, buffer conditions and the 

15 like. 

A diagnostic system of the present invention preferably also 
includes a label or indicating means capable of signaling the formation of an 
immunocomplex containing a polypeptide or antibody molecule of the present 
invention. 

20 The word "complex" as used herein refers to the product of a 

specific binding reaction such as an antibody-antigen or receptor-ligand reaction. 
Exemplary complexes are immunoreaction products. 

As used herein, the terms "label" and "indicating means" in their 
various grammatical forms refer to single atoms and molecules that are either 

25 directly or indirectly involved in the production of a detectable signal to indicate 
the presence of a complex. Any label or indicating means can be linked to or 
incorporated in an expressed protein, polypeptide, or antibody molecule that is 
part of an antibody or monoclonal antibody composition of the present invention, 
or used separately, and those atoms or molecules can be used alone or in 

30 conjunction with additional reagents. Such labels are themselves well-known in 



WO 98/05352 



PCT/US97/13657 



clinical diagnostic chemistry and constitute a part of this invention only insofar as 
they are utilized with otherwise novel proteins methods and systems. 

The labeling means can be a fluorescent labeling agent that 
chemically binds to antibodies or antigens without denaturing them to form a 
5 fluorochrome (dye) that is a useful immunofluorescent tracer. Suitable 

fluorescent labeling agents are fluorochromes such as fluorescein isocyanate 
(FIC), fluorescein isothiocyante (FITC), 5-diethylamine-l-naphthalenesulfonyl 
chloride (DANSC), tetramethylrhodamine isothiocyanate (TRITC), lissamine, 
rhodamine 8200 sulphonyl chloride (RB 200 SC) and the like. A description of 
10 immunofluorescence analysis techniques is found in DeLuca, 

"Immunofluorescence Analysis", in Antibody As a TooK Marchalonis, et al., 
eds., John Wiley & Sons, Ltd., pp. 189-231 (1982), which is incorporated herein 
by reference. 

In preferred embodiments, the indicating group is an enzyme, such 

15 as horseradish peroxidase (HRP), glucose oxidase, or the like. In such cases 
where the principal indicating group is an enzyme such as HRP or glucose 
oxidase, additional reagents are required to visualize the fact that a receptor- 
ligand complex (immunoreactant) has formed. Such additional reagents for HRP 
include hydrogen peroxide and an oxidation dye precursor such as 

20 diaminobenzidine. An additional reagent useful with glucose oxidase is 2,2'- 
amino-di-(3-ethyl-benzthiazoline-G-sulfonic acid) (ABTS). 

Radioactive elements are also useful labeling agents and are used 
illustratively herein. An exemplary radiolabeling agent is a radioactive element 
that produces gamma ray emissions. Elements which themselves emit gamma 

25 rays, such as 124 I, 125 I, l28 I, ,32 I and 5l Cr represent one class of gamma ray 

emission-producing radioactive element indicating groups. Particularly preferred 
is 125 I. Another group of useful labeling means are those elements such as n C, 
18 F, ls O and 13 N which themselves emit positrons. The positrons so emitted 
produce gamma rays upon encounters with electrons present in the aniinaFs body. 

30 Also useful is a beta emitter, such as m In or 3 H. 
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The linking of labels, i.e., labeling of, polypeptides and proteins is 
well known in the art. For instance, antibody molecules produced by a 
hybridoma can be labeled by metabolic incorporation of radio isotope-containing 
amino acids provided as a component in the culture medium. See, for example, 
5 Galfre et aL, Meth. Enzvmol. . 73:3-46 (1981). The techniques of protein 
conjugation or coupling through activated functional groups are particularly 
applicable. See, for example, Aurameas, et al., Scand. J. Immunol. , Vol. 8 
Suppl. 7:7-23 (1978), Rodwell et al., Biotech. . 3:889-894 (1984), and U.S. Pat, 
No. 4,493,795. 

10 The diagnostic systems can also include, preferably as a separate 

package, a specific binding agent. A "specific binding agent" is a molecular 
entity capable of selectively binding a reagent species of the present invention or 
a complex containing such a species, but is not itself a polypeptide or antibody 
molecule composition of the present invention. Exemplary specific binding 

15 agents are second antibody molecules, complement proteins or fragments thereof , 
S. aureus protein A, and the like. Preferably the specific binding agent binds the 
reagent species when that species is present as part of a complex. 

In preferred embodiments, the specific binding agent is labeled. 
However, when the diagnostic system includes a specific binding agent that is not 

20 labeled, the agent is typically used as an amplifying means or reagent. In these 
embodiments, the labeled specific binding agent is capable of specifically binding 
the amplifying means when the amplifying means is bound to a reagent species- 
containing complex. 

The diagnostic kits of the present invention can be used in an 

25 "ELISA" format to detect the quantity of hypocretin in a sample. "ELISA 1 ' 
refers to an enzyme-linked immunosorbent assay that employs an antibody or 
antigen bound to a solid phase and an enzyme-antigen or enzyme-antibody 
conjugate to detect and quantify the amount of an antigen present in a sample. A 
description of the ELISA technique is found in Chapter 22 of the 4th Edition of 

30 Basic and Clinical Immunology by D.P. Sites et al., published by Lange Medical 
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Publications of Los Altos, CA in 1982 and in U.S. Patents No. 3,654,090; No. 
3,850,752; and No. 4,016,043, which are all incorporated herein by reference. 

In some embodiments, a hypocretin polypeptide, an antibody or a 
monoclonal antibody of the present invention can be affixed to a solid matrix to 
5 form a solid support that comprises a package in the subject diagnostic systems. 

A reagent is typically affixed to a solid matrix by adsorption from 
an aqueous medium although other modes of affixation applicable to proteins and 
polypeptides can be used that are well known to those skilled in the art. 
Exemplary adsorption methods are described herein. 

10 Useful solid matrices are also well known in the art. Such 

materials are water insoluble and include the cross-linked dextran available under 
the trademark SEPHADEX from Pharmacia Fine Chemicals (Piscataway, NJ); 
agarose; beads of polystyrene beads about 1 micron (/x) to about 5 millimeters 
(mm) in diameter available from Abbott Laboratories of North Chicago, IL; 

15 polyvinyl chloride, polystyrene, cross-linked poly aery lamide, nitrocellulose- or 
nylon-based webs such as sheets, strips or paddles; or tubes, plates or the wells 
of a microliter plate such as those made from polystyrene or polyvinylchloride. 

The reagent species, labeled specific binding agent or amplifying 
reagent of any diagnostic system described herein can be provided in solution, as 

20 a liquid dispersion or as a substantially dry power, e.g., in lyophilized form. 
Where the indicating means is an enzyme, the enzyme's substrate can also be 
provided in a separate package of a system. A solid support such as the before- 
described microliter plate and one or more buffers can also be included as 
separately packaged elements in this diagnostic assay system. 

25 The packaging materials discussed herein in relation to diagnostic 

systems are those customarily utilized in diagnostic systems. 

G. Cell Lines Expressing Hypocretin 
The invention also includes a host cell transformed with a 
recombinant DNA (recombinant DNA) molecule of the present invention. The 

30 host cell can be either procaryotic or eucaryotic, although eucaryotic cells are 
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preferred, particularly mammalian cells. Preferred cells are isolated, that is, 
substantially homogeneous and therefore free from other cell types or other cells 
having a hypocretin protein expressed therein. 

A cell expressing a hypocretin of this invention has a variety of 
5 uses according to this invention. Particularly preferred are uses for bulk 
production of hypocretin, for the purpose of providing immunogen for 
production of antibody, for supply of therapeutic protein, for direct binding or for 
screening pharmaceutical compound banks for the presence of hypocretin 
receptor-specific ligands, i.e., in drug screening assays as described herein. 
10 Thus, particularly preferred are cells containing a recombinant DNA molecule 
that expresses a hypocretin protein of this invention. 

In one embodiment, a cell is produced for transplantation into a 
body tissue, thereby expressing hypocretin and providing replacement therapy. 
The cell can be syngeneic, and typically will be a brain tissue-derived cell, such 
15 as a hippocampal cell, neonatal brain tissue cell, glioma and the like neuronal 

tissue cell. Transplantation is accomplished using surgical procedures available to 
a neurosurgeon where the transplantation is to be made into the brain, brain stem 
or other neurological tissues. In preferred embodiments, the cell contains a 
vector for expressing the hypocretin in which the expression means is under the 
20 control of a regulatable promoter, as is well known, such that expression of the 
hypocretin protein can be regulated. 

Eucaryotic cells useful for expression of a hypocretin protein are 
not limited, so long as the cell or cell line is compatible with cell culture methods 
and compatible with the propagation of the expression vector and expression of 
25 the hypocretin protein gene product. Preferred eucaryotic host cells include yeast 
and mammalian cells, preferably vertebrate cells such as those from a mouse, rat, 
monkey or human fibroblastic cell line. Preferred eucaryotic host cells include 
Chinese hamster ovary (CHO) cells available from the ATCC as CCL61, NIH 
Swiss mouse embryo cells NIH/3T3 (ATCC CRL 1658), HELA cells (ATCC 
30 CCL 2), baby hamster kidney cells (BHK), COS-7, COS-1, HEK293 (ATCC 
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CRL 1573), Ltk-1, AV-12 (ATCC CRL 9595), and the like eucaryotic tissue 

culture cell lines. 

Transformation of appropriate cell hosts with a recombinant DNA 

molecule of the present invention is accomplished by well known methods that 
5 typically depend on the type of vector used. With regard to transformation of 

procaryotic host cells, see, for example, Cohen et al., Proc. Natl. Acad. Sci. 

USA , 69:2110 (1972); and Maniatis et al., Molecular Cloning, A Laboratory 

Mammal Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982). 

With regard to transformation of vertebrate cells with vectors 
10 containing recombinant DNAs, see, for example, Graham et al., Virol. , 52:456 

(1973); Wigler et aL, Proc. Natl. Acad. Sci. USA . 76:1373-76 (1979), and the 

teachings herein. 

Successfully transformed cells, i.e., cells that contain a 

recombinant DNA molecule of the present invention, can be identified by well 
15 known techniques. For example, cells resulting from the introduction of an 

recombinant DNA of the present invention can be cloned to clonally 

homogeneous cell populations that contain the recombinant DNA. Cells from 

those colonies can be harvested, lysed and their DNA content examined for the 

presence of the recombinant DNA using a method such as that described by 
20 Southern, J. Mol. Biol. , 98:503 (1975) or Berent et aL, Biotech. , 3:208 (1985). 

In addition to directly assaying for the presence of recombinant 

DNA, successful transformation can be confirmed by well known immunological 

methods when the recombinant DNA is capable of directing the expression of 

hypocretin or by the detection of hypocretin binding activity. 
25 For example, cells successfully transformed with an expression 

vector produce proteins displaying hypocretin antigenicity or biological activity. 

Samples of cells suspected of being transformed are harvested and assayed for 

either hypocretin biological activity or antigenicity. 

In addition to the transformed host cells themselves, the present 
30 invention also includes a culture of those cells, preferably a monoclonal (clonally 
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homogeneous) culture, or a culture derived from a monoclonal culture, in a 
nutrient medium. Preferably, the culture also contains a protein displaying 
hypocretin antigenicity or biological activity. 

Nutrient media useful for culturing transformed host cells are well 
5 known in the art and can be obtained from several commercial sources. In 

embodiments wherein the host cell is mammalian, a " serum-free " medium can be 
used. 

H. Screening Methods to Identify 

Agonists and Antagonists of Hvpocretin 

10 

The ability to selectively bind/modulate function of a hypocretin 
receptor by a hypocretin ligand is at the heart of useful hypocretin pharmacology, 
and depends on identifying pharmacological molecules which can act a selective 
ligands, agonists or antagonists for a hypocretin receptor. To that end, the 

15 elucidation of new hypocretin proteins, such as those described herein, provides 
valuable tools for the search for selective reagents, tools that are useful in binding 
assays, and in screening assays which indicate selective drug response to the 
hypocretin receptor. 

The invention includes methods for determining whether a 

20 molecule binds to, and preferably whether the molecule activates, a preselected 
hypocretin receptor. 

The method comprises conducting a binding assay to identify 
molecules which bind the hypocretin receptor, as described in any of the assays 
herein. Thus, the method comprises (1) contacting a candidate molecule with a 

25 cell having a hypocretin receptor under conditions permitting binding of 
hypocretin to the receptor, and (2) detecting the presence of the candidate 
molecule bound to the hypocretin receptor, thereby determining whether the 
candidate binds to the receptor. The receptor is typically a cell surface protein 
when expressed by the cells. 

30 Alternatively, one can use a competition format to identify analogs 

of hypocretin by using a labeled hypocretin, and measuring the amount of bound 
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label in the presence of a candidate ligand, indicating whether the candidate 
competes with labeled hypocretin for binding to the receptor. An exemplary 
competition assay is described herein. 

It is also possible to use the above method to determine whether 
5 the molecule which binds to the hypocretin receptor also activates or motivates 
the receptor's function, i.e., acts as an agonist, or determine whether the 
molecule inhibits the receptor's function, i.e., acts as an antagonist, or acts as 
and inverse agonist. Thus, by evaluating in the detecting step whether the 
hypocretin receptor is activated, one determines whether the candidate molecule is 
10 bioactive. 

Methods for detecting bioactivity of the candidate molecule can 
vary, but typically involve measuring changes in intracellular levels of a 
secondary messenger effected as a result of binding, detecting changes in 
electrical potential, observing physiological or behavioral effects related to 
15 hypocretin function, and the like methods. Exemplary assays for binding or for 
hypocretin-specific bioactivity are described in the Examples and include 
measurement of electrical changes of hypothalamic neurons, measurement of food 
intake or body temperature, or direct binding to a cell having a hypocretin 
receptor. 

20 It is noted that the hypocretin receptor has not been characterized 

in extensive detail. Thus, any receptor that binds hypocretin can be referred to as 
a hypocretin receptor for the purposes of a screening assay, although receptors 
with the highest affinity and specificity for hypocretin are preferred. In 
practicing the present screening methods, one can use any of a variety of cells 

25 lines or tissues that possess a hypocretin receptor, including the exemplary cell 
lines and tissues described herein. The invention should not be construed as 
limiting so long as the binding or bioactivity assay involves the use of a 
hypocretin receptor. In preferred embodiments, a receptor that is specific for 
hypocretin should be used. Specificity can be demonstrated by well known 

30 methods of ligand binding and ligand-mediated activation. 
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A related embodiment includes a method for screening to identify a 
candidate molecule that can bind, inhibit or activate a preselected hypocretin 
receptor by functioning as a hypocretin agonist or antagonist. The method 
comprises: 

5 (a) contacting a mammalian cell with said candidate drug under 

conditions permitting activation of said hypocretin receptor by hypocretin; and 
(b) detecting the activation status of said hypocretin receptor, 
and thereby determining whether the drug activates or inhibits said receptor. 

I. Methods for Altering Hypocretin Receptor Function 
10 a. Therapeutic Methods 

The certain reagents described in the present invention have 
the capacity to modulate hypocretin receptor function, such as agonists or 
antagonists, and therefore are useful in therapeutic methods for conditions 
mediated by the hypocretin receptor. 
15 Hypocretin polypeptides that mimic exposed regions of hypocretin 

have the ability to function as analogs and compete for binding to the hypocretin 
receptor, or for other agents that would normally interact with the receptor, 
thereby inhibiting binding of hypocretin to the receptor. 

Furthermore, antibodies and monoclonal antibodies of the present 
20 invention that bind to exposed regions of hypocretin have the capacity to alter 

hypocretin receptor function by blocking natural interactions with hypocretin that 
normally interact at the site. Exemplary antibodies are the anti-hypocretin 
antibodies described earlier. 

Finally, oligonucleotides are described herein which are 
25 complementary to mRNA that encodes a hypocretin protein of this invention and 
that are useful for reducing gene expression and translation of the hypocretin 
mRNA, thereby altering hypocretin levels in a tissue. 

In one embodiment, the present invention provides a method for 
modulating hypocretin function in an animal or human patient comprising 
30 administering to the patient a therapeutically effective amount of a physiologically 
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tolerable composition containing a hypocretin polypeptide, analog or 
peptidomimetic, anti-hypocretin antibody or monoclonal antibody, hypocretin 
agonist or antagonist, or an oligonucleotide of the present invention. 

A therapeutically effective amount of a hypocretin polypeptide, as 
5 an example for practicing the invention, is a predetermined amount calculated to 
achieve the desired effect, i.e., to modulate receptor interaction with its normal 
target, and thereby interfere with normal receptor function. Depending on the 
structure of the particular peptide the binding of some peptides will activate the 
receptor, while binding of other peptides will not activate the receptor. 

10 Similarly, a therapeutically effective amount of an anti-hypocretin 

antibody is a predetermined amount calculated to achieve the desired effect, i.e., 
to immunoreact with the hypocretin, and thereby inhibit the hypocretin receptor's 
ability to interact with its normal target, hypocretin, and thereby interfere with 
normal receptor function. 

15 The in vivo inhibition of hypocretin receptor function using a 

hypocretin polypeptide, an anti-hypocretin antibody, or hypocretin agonist or 
antagonist of this invention is a particularly preferred embodiment and is 
desirable in a variety of clinical settings, such as where the patient is exhibiting 
symptoms of an over or under activated hypocretin receptor. 

20 A therapeutically effective amount of a hypocretin polypeptide, 

agonist or antagonist of this invention is typically an amount such that when 
administered in a physiologically tolerable composition is sufficient to achieve a 
plasma concentration of from about 0.1 nanomolar (nM) to about 100 nM, and 
preferably from about 0.5 nM to about 10 nM. 

25 A therapeutically effective amount of an antibody of this invention 

is typically an amount of antibody such that when administered in a 
physiologically tolerable composition is sufficient to achieve a plasma 
concentration of from about 0.1 microgram (iig) per milliliter (ml) to about 100 
/Ag/ml, preferably from about 1 fig/m\ to about 5 /xg/ml, and usually about 5 

30 /xg/ml. 
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The effectiveness of the therapy can be determined by observing 
ablation of the symptoms associated with the function of the hypocretin receptor 
being inhibited. 

The therapeutic compositions containing a hypocretin polypeptide, 
5 agonist, antagonist or anti-hypocretin antibody of this invention are conventionally 
administered intravenously or by a method for delivery to a brain tissue, as by 
injection of a unit dose, for example. The term "unit dose 1 ' when used in 
reference to a therapeutic composition of the present invention refers to physically 
discrete units suitable as unitary dosage for the subject, each unit containing a 
10 predetermined quantity of active material calculated to produce the desired 

therapeutic effect in association with the required diluent; i.e., carrier, or vehicle. 

Delivery to a brain tissue or CSF can be accomplished by a variety 
of means, including by direct injection, by use of a cannula into the target tissue, 
by direct application in a surgical procedure, by adsorption across the blood-brain 
15 barrier following intravenous administration, by viral vectors, and the like means. 

The therapeutic compounds and compositions are generally 
administered so as to contact the cells or the tissue containing cells which contain 
the target hypocretin receptor. This administration can be accomplished by 
introduction of the composition internally such as orally, intravenously, 
20 intramuscularly, intranasally or via inhalation of aerosols containing the 

composition, and the like, by cannula into a brain tissue, or by introduction into 
or onto a tissue system as by introduction transdermally, topically or 
intralesionally, in suppositories, or by intra-orbital injection, and the like. 

The compositions are administered in a manner compatible with the 
25 dosage formulation, and in a therapeutically effective amount. The quantity to be 
administered depends on the subject to be treated, capacity of the subject's system 
to utilize the active ingredient, and degree of therapeutic effect desired. Precise 
amounts of active ingredient required to be administered depend on the judgement 
of the practitioner and are particular to each individual. However, suitable 
30 dosage ranges for systemic application are disclosed herein and depend on the 
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route of administration. Suitable regimes for initial administration and booster 
shots are also variable, but are typified by an initial administration followed by 
repeated doses at one or more hour intervals by a subsequent injection or other 
administration. Alternatively, continuous intravenous infusion sufficient to 
5 maintain concentrations in the CSF or blood in the ranges specified for in vivo 
therapies are included. 

As an aid to the administration of effective therapeutic amounts of 
a hypocretin polypeptide, agonist, antagonist, antibody, or monoclonal antibody, 
(hereinafter a "therapeutic agent") a diagnostic method of this invention for 
10 detecting a therapeutic agent in the subject's CSF or blood is useful to 

characterize the fate of the administered therapeutic agent. Suitable diagnostic 
(monitoring) assays are described herein. 

b. Methods for Inhibiting Gene Expression 

In another embodiment, the invention includes the use of 
15 nucleic acids encoding portions of a hypocretin gene for inhibiting gene 
expression and function. 

The present invention provides for a method for inhibiting 
expression of hypocretin gene products and thereby inhibiting the function of the 
target hypocretin protein. The DNA segments and their compositions have a 
20 number of uses, and may be used in vitro or in vivo . In vitro , the compositions 
may be used to block function and expression of hypocretin in cell cultures, 
tissues, organs and the like materials that can express hypocretin. In vivo, the 
compositions may be used prophylactically or therapeutically for inhibiting 
expression of a hypocretin gene, and by inhibiting diseases or medical conditions 
25 associated with the expression or function of the hypocretin gene or the activity 
state of its receptor. 

The method comprises, in one embodiment, contacting cells or 
tissues with a therapeutically effective amount of a pharmaceutical^ acceptable 
composition comprising a DNA segment of this invention. In a related 
30 embodiment, the contacting involves introducing the DNA segment composition 
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into cells expressing a hypocretin protein. 

The DNA segment can be in a variety of forms, but is preferably 
in a single-stranded form to facilitate complementary hybridization to the target 
mRNA in the cell in which the hypocretin gene expression is to be altered. 
5 The term "cells" is intended to include a plurality of cells as well 

as single cells. The cells can be isolated, or can be cells that form a larger 
organization of cells to form a tissue or organ. 

Another embodiment is a method of inhibiting the expression of 
hypocretin genes in a patient comprising administration to the patient of a 
10 therapeutically effective amount of a DNA segment composition of this invention 
in a pharmaceutically acceptable excipients. In cases where the distribution of the 
hypocretin is believed to be disseminated in the body, the administration of 
therapeutic oligonucleotide can be systemic. Alternatively, the target hypocretin 
can be localized to a tissue, and the therapeutic method can likewise be directed 
15 at delivering the therapeutic DNA segment to the tissue to be treated. 

The concentration of the active DNA segment ingredient in a 
therapeutic composition will vary, depending upon the desired dosage, use, 
frequency of administration, and the like. The amount used will be a 
therapeutically effective amount and will depend upon a number of factors, 
20 including the route of administration, the formulation of the composition, the 

number and frequency of treatments and the activity of the formulation employed. 

The use of therapeutic DNA segments, and therefore the delivery 
of those DNA segments into cells where they are effective, has been described in 
a variety of settings. It is generally known that therapeutically effective 
25 intracellular levels of nucleic acids, and particularly smaller nucleic acids such as 
DNA segments and oligonucleotides, can be achieved by either exposing cells to 
solutions containing nucleic acids or by introduction of the nucleic acids into the 
inside of the cell. Upon exposure, nucleic acids are taken up by the cell where 
they exert their effectiveness. In addition, direct introduction into the cell can be 
30 provided by a variety of means, including microinjection, delivery by the use of 
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specific uptake vehicles, and the like. 

The pharmaceutical composition containing the therapeutic 
oligonucleotide preferably also contains physiologically acceptable carriers, in 
particular hydrophobic carriers which facilitate carrying the oligonucleotide 
5 through the cell membrane or blood brain barrier. 

Exemplary descriptions of the delivery of therapeutic DNA 
segments and oligonucleotides into cells can be found in the teachings of United 
States Patent Nos. 5,04,820, 4,806,463, 4,757,055, and 4,689,320, which 
teachings are hereby incorporated by reference. 

10 A therapeutically effective amount is a predetermined amount 

calculated to achieve the desired effect, i.e., to bind to a hypocretin gene present 
and thereby inhibit function of the gene. 

As is apparent to one skilled in the art, the copy number of a 
hypocretin gene may vary, thereby presenting a variable amount of target with 

15 which to hybridize. Thus it is preferred that the therapeutic method achieve an 
intracellular concentration of a therapeutic DNA segment of this invention in 
molar excess to the copy number of the gene in the cell, and preferably at least a 
ten- fold, more preferably at least a one-hundred fold, and still more preferably at 
least a one thousand-fold excess of therapeutic DNA segments relative to the gene 

20 copy number per cell. A preferred effective amount is an intracellular 

concentration of from about 1 nanomolar (nM) to about 100 micromolar OxM), 
particularly about 50 nM to about 1 /xM. 

Alternatively, a therapeutically effective amount can be expressed 
as an extracellular concentration. Thus it is preferred to expose a cell containing 

25 a hypocretin gene to a concentration of from about 100 nM to about 10 

millimolar (mM), and preferably about 10 fjM to 1 mM. Thus, in embodiments 
where delivery of a therapeutic DNA segment composition is designed to expose 
cells to the nucleic acid for cellular uptake, it is preferred that the local 
concentration of the DNA segment in the area of the tissue to be treated reach the 

30 extracellular concentrations recited above. 
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For patient dosages, using a 20 nucleotide base double-stranded 
DNA segment as the standard, a typical dosage of therapeutic composition for a 
70 kilogram (kg) human contains in the range of about 0.1 milligram (mg) to 
about 1 gram of 20-mer DNA segment per day, and more usually in the range of 
5 about 1 mg to 100 mg per day. Stated differently, the dosage is about 1 /xg/kg/g 
day to about 15 mg/kg/day, and preferably about 15 to 1500 /xg/kg/day. 

The in vivo inhibition of hypocretin gene expression and function 
by a therapeutic composition of this invention is desirable in a variety of clinical 
settings, such as where the patient is at risk for disease based on expression of 
10 the hypocretin gene. 

c. Therapeutic Compositions 

The present invention includes therapeutic compositions useful for 
practicing the therapeutic methods described herein. Therapeutic compositions of 
the present invention contain a physiologically tolerable carrier together with a 
15 therapeutic reagent of this invention, namely a hypocretin polypeptide, an anti- 
hypocretin antibody or monoclonal antibody, or oligonucleotide as described 
herein, dissolved or dispersed therein as an active ingredient. In a preferred 
embodiment, the therapeutic composition is not immunogenic when administered 
to a mammal or human patient for therapeutic purposes. 
20 As used herein, the terms M pharmaceutical^ acceptable", 

"physiologically tolerable" and grammatical variations thereof, as they refer to 
compositions, carriers, diluents and reagents, are used interchangeably and 
represent that the materials are capable of administration to or upon a mammal 
without the production of undesirable physiological effects such as nausea, 
25 dizziness, gastric upset and the like. 

The preparation of a pharmacological composition that contains 
active ingredients dissolved or dispersed therein is well understood in the art. 
Typically such compositions are prepared as injectables either as liquid solutions 
or suspensions, however, solid forms suitable for solution, or suspensions, in 
30 liquid prior to use can also be prepared. The preparation can also be emulsified. 
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The active ingredient can be mixed with excipient which are 
pharmaceutical^ acceptable and compatible with the active ingredient and in 
amounts suitable for use in the therapeutic methods described herein. Suitable 
excipient are, for example, water, saline, dextrose, glycerol, ethanol or the like 
5 and combinations thereof. In addition, if desired, the composition can contain 
minor amounts of auxiliary substances such as wetting or emulsifying agents, pH 
buffering agents and the like which enhance the effectiveness of the active 
ingredient. 

The therapeutic composition of the present invention can include 
10 pharmaceutical^ acceptable salts of the components therein. Pharmaceutically 

acceptable salts include the acid addition salts (formed with the free amino groups 
of the polypeptide) that are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, tartaric, 
mandelic and the like. Salts formed with the free carboxyl groups can also be 
15 derived from inorganic bases such as, for example, sodium, potassium, 
ammonium, calcium or ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine and the 
like. 

Physiologically tolerable carriers are well known in the art. 

20 Exemplary of liquid carriers are sterile aqueous solutions that contain no 

materials in addition to the active ingredients and water, or contain a buffer such 
as sodium phosphate at physiological pH value, physiological saline or both, such 
as phosphate-buffered saline. Still further, aqueous carriers can contain more 
than one buffer salt, as well as salts such as sodium and potassium chlorides, 

25 dextrose, polyethylene glycol and other solutes. 

As described herein, for intracellular delivery of oligonucleotides, 
specialized carriers may be used which facilitate transport of the oligonucleotide 
across the cell membrane. These typically are hydrophobic compositions, or 
include additional reagents which target delivery to and into cells. 

30 Liquid compositions can also contain liquid phases in addition to 
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and to the exclusion of water. Exemplary of such additional liquid phases are 
glycerin, vegetable oils such as cottonseed oil, and water-oil emulsions. 

A therapeutic composition contains an amount of a hypocretin 
polypeptide or anti-hypocretin antibody molecule of the present invention 

5 sufficient to inhibit hypocretin function. Typically this is an amount of at least 
0.1 weight percent, and more preferably is at least 1 weight percent, of peptide 
or antibody per weight of total therapeutic composition. A weight percent is a 
ratio by weight of peptide or antibody to total composition. Thus, for example, 
0.1 weight percent is 0.1 grams of polypeptide per 100 grams of total 

10 composition. 

The following Examples are illustrative of one means of practicing 
certain aspects of the invention disclosed herein and should not be construed so as 
impart any undue limitations upon the invention as claimed below. 

EXAMPLE 1 

15 Young adult Sprague-Dawley rats of both genders were sacrificed under 

anesthesia by decapitation and their brains quickly removed. The hypothalamus, 
hippocampus, and cerebellum were immediately dissected on an ice-cold plate 
following the boundaries described by Glowinski and Iversen (Glowinski, J., & 
Iversen, L.L., J. Neurochem. 13:655-669, 1966). The block of hypothalamic 

20 tissue was 2 mm deep and was taken using the optic chiasm as the rostral limit 
and the mammillary bodies as caudal reference. Cytoplasmic RNA was isolated 
rapidly from the dissected tissues (Schibler, K., Tosi, M., Pittet, A.C., Fabiani, 
L., & Wellauer, P.K., /. MoL Biol. 142:93-116, 1980) and enriched for poly(A)- 
containing species by oligo(dT)-cellulose chromatography (Aviv, H., & Leder, 

25 P., Proc. Natl Acad. ScL USA 69:1408-1412, 1972). For the Northern blots, 

RNA was isolated (Chirgwin, J.M., Przybyla, A.E., MacDonald, R.J., & Rutter, 
W.J., Biochemistry 18:5294, 1979) from frozen tissue purchased from Zivic- 
Miller (Zelienople, PA). cDNA libraries were prepared as described previously 
Usui et al, supra, except that pBCSK + was used for the subtracted library rather 
30 than pT7T3D because lower backgrounds have been found in the subsequent steps 
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using the former vector (H. Usui, personal communication). The number of 
recombinants in the libraries were: pT7T3D hypothalamus 8xl0 6 ; cerebellum 
pGEMUZf (-) 5 x 10 5 ; hippocampus pGEMllZf(-) 1 x 10 6 . 

Subtractive hybridization was performed in two cycles using the 
5 previously described procedure (Usui et al., supra). Briefly, 1 fxg of trace- 
labeled, tagged hypothalamus target cDNA prepared as described from the 
pT7T3D target library was annealed for 24 hrs at 68 degree Celsius in 10 /xl of 
hybridization buffer (Usui et al, supra) with 20 fxg cerebellum cRNA (ratio 1:20). 
After hydroxy apatite chromatography, the single-stranded fraction corresponded 

10 to 10% of the input material, as judged by tracer quantitation. This was mixed 
with 20 /xg of hippocampus cRNA (estimated ratio 1:200) for a second 24 hr 
hybridization, after which 30% of the input chromatographed at the single-strand 
position. Cumulatively, these steps removed more than 97% of the input tracer. 
An aliquot of the single-stranded material was used as template in a 30-cycle PCR 

15 (program: 94 degree Celsius for 15 sec, 60 degree Celsius for 15 sec, 72 degree 
Celsius for 1 min) using primers corresponding to the tag sequences (Usui et al., 
supra): 5 ' - AACTGG A AG A ATTCGCGG-3 ' and 5'- 

AGGCC A AG A ATTCGGC ACG A-3 ' . The amplification product was cleaved 
with NotI, then EcoRI, and inserted into pBCSK + . A dot blot was prepared and 

20 screened with probes prepared from the target, subtracted target and driver 

libraries as previously described by Usui et al, supra, using serial dilutions of 
plasmid cDNA clones isolated previously in this laboratory. The target and 
subtracted target cDNA libraries were screened to determine the frequency of 
oxytocin and VAT-1 cDNA clones using as probes clones isolated in the present 

25 study. 

Clone 35 cDNA from the subtracted rat hypothalamus library was 
used as a probe to screen a rat brain cDNA library in the plasmid pHG327 as 
described by Forss-Petter et aL, J. MoL Neurosci. 1:63-75 (1989). The cDNA 
library was constructed as described by Staeheli et al., Cell 44:147-158 (1986). 
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EXAMPLE 2 

Similarly, following the procedures of Example 1, a mouse 
(C57/B16) hypothalamus cDNA library, constructed in the pT7T3D vector, was 
used as a template for PCR amplification (primers 5' 

5 TAAGACGACGGCCTCAG 3' and 5' CACACCAACAGAGAAACG 3') to 

obtain the mouse homolog of the rat H35 cDNA obtained above. The mouse and 
rat cDNA and protein sequences are compared in Fig. 5. The 569 nucleotide rat 
sequence has the potential to encode a 130-residue putative secretory protein 
(preprohypocretin) with an apparent signal sequence and 3 additional sites for 

10 potential proteolytic maturation (Fig. 5A). Two of the putative products of 
proteolysis (hcrtl and hcrt2) have 14 amino acid identities across 20 residues 
(Figs. 5B). This region of one of the peptides contains a 111 match with secretin 
(Fig. 5B), suggesting that the prepropeptide gives rise to two peptide products 
that are structurally related both to each other and to secretin. 

15 The mouse hypocretin nucleotide sequence differs in 35 positions 

relative to the rat, and contains 16 additional nucleotides near its 3' end. Of 
these differences, 19 are within the putative protein-coding region (Fig. 5A), only 
7 of which affect the encoded protein sequence: one amino acid difference at 
residue 3 is a neutral substitution in the apparent secretion signal sequence; the 

20 remaining 6 differences are near the C-terminus, one of which obliterates a 

potential proteolytic cleavage site. The absence of this site and the nature of the 
other differences make it unlikely that two of the four possible rat maturation 
products are generated and functional in mice. However, the two putative hcrt 
peptides that are related both to each other and to the secretin family are 

25 absolutely preserved between the two species, providing strong support for the 

notion that these peptides have a function conserved during evolution. Both hcrtl 
and hcrt2 terminate with glycine residues, which typically are substrates for , 
leaving the nitrogen of the terminal glycine as a C-terminal amide in the mature 
peptide. 

30 Several hypocretin peptides are distinguished within the sequence of 
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hypocretin (Fig. 5). The peptide from about amino acid residue 28 to about 
amino acid residue 130 (SEQ ID NO: 6) represents the peptide produced by 
cleavage of the signal peptide. The peptide from about amino acid residue 28 to 
about amino acid residue 66 (SEQ ID NO:7) corresponds to hcrtl. The peptide 

5 from about amino acid residue 28 to about amino acid residue 65 (SEQ ID NO: 8) 
corresponds to hcrtl matured by peptidylglycine alpha-amidating monooxygenase, 
leaving the nitrogen of the terminal glycine as a C-terminal amide in the mature 
peptide. The peptide from about amino acid residue 70 to about amino acid 
residue 97 (SEQ ID NO:9) corresponds to hcrt2. The peptide from about amino 

10 acid residue 70 to about amino acid residue 96 (SEQ ID NO: 10) corresponds to 
hcrt2 matured by peptidylglycine alpha-amidating monooxygenase, leaving the 
nitrogen of the terminal glycine as a C-terminal amide in the mature peptide. 
The peptide from about amino acid residue 47 to about amino acid residue 66 
(SEQ ID NO: 11) corresponds to the consensus sequence region of hcrtl (Fig. 

15 5B). The peptide from about amino acid residue 78 to about amino acid residue 
97 (SEQ ID NO: 12) corresponds to the consensus region of Hrct2. The peptide 
GNHAAGILT (SEQ ID NO: 13) is common to both hcrtl and hcrt2. 

EXAMPLE 3 

Rat H35 (SEQ ID NO:3) is inserted into the BamHl sites of a 
20 pHG237 vector. Upon digestion with BamHl restriction enzyme, the resultant 
569 bp fragment is then inserted directly into the Bglll site of the polylinker 
region of the pCM 4 vector (D. Russell, U. Texas Southwestern Medical Center, 
Dallas, TX), which uses the cytomegalovirus (CMV) promoter. Several eight to 
ten amino acid epitope tags are added by PCR to the C-terminus of H35 to allow 
25 visualization of the expressed product. 

The respective 5' and 3' primers, 5' 
ATCGAGATCTAGACACCATGAACCTTCCTTCTACAAAGGTT 3' and 5' 
ACTGTCTAGATCATAGATCTTCTTCAGAAATAAGTTTTTGTTCGACTCTG 
GATCCGCCCCGGGGCGCT 3', are used as primers to amplify H35 beginning 
30 at position 85 in SEQ ID NO:3 with an inserted Bglll site added at its 5' end to 
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the 3' end having an inserted c-myc epitope tag. The PCR products are 
subcloned into pCMV and transfected into a mammalian host cell to produce an 
H35-myc tagged protein product. 

H35 proteins are also produced in bacteria by subcloning the H35 
5 coding sequence into pRSET B (Invitrogen, San Diego, CA), which encodes six 
histidines prior to the H35 sequence. The vector contains a T7 promoter which 
drives expression of 6XHis-tagged proteins in E, coli. The respective 5 'and 3' 
oligonucleotides 5' ATCGAGATCTCTTGGGGTGGACGCGCAGCCT 3' and 5' 
ACTGAATTCTCAGACTCTGGATCCGCCCCG 3' are used as PCR primers to 

10 amplify the rat H35 sequence into the Bglll and EcoRI sites of the pRSET B 
vector. The resulting hypocretin-poly-(His) fusion protein may be purified by 
affinity chromatography on a metal affinity resin. 

An H35-glutathione-S-transferase fusion protein is produced in E. 
coli by subcloning the H35 sequence into a pGEX2 vector (Pharmacia). 

15 EXAMPLE 4 

The mouse Hcrt gene was mapped to Chromosome 1 1 using an 
interspecific backcross. A single-strand sequence polymorphism between 
C57BL/6J and SPRET/Ei was detected as previously described and mapped on 
The Jackson Laboratory BSS panel. An //err-specif ic product of approximately 

20 600 base pairs was amplified from mouse C57BL/6J genomic DNA using 
synthetic oligonucleotides 5'-GACGGCCTCAGACTTCTTGG-3' and 5 1 - 
GCAAC AGTTCGTAGAGACGG-3 ' . This product contained a putative intron, 
and its identity as hcrt was confirmed by sequencing (data not shown). Genotype 
data and references for these and other linked markers can be accessed via the 

25 Mouse Genome Database (http://www.informatics.jax.org). 

No recombinants in 94 BSS mice were found between hcrt and the 
previously mapped loci Brcal, Tubg and Mpmv8, placing Hcrt maximally within 
3.8 cM (95% confidence limit) of these genes. The Hoxb cluster is 
approximately 1 cM centromeric to Hcrt, and the Kcnj2 gene is located 

30 approximately 4 cM telomeric. Hcrt is located in the portion of mouse 
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0^ 

Chromosome 11 that shows conserved synteny with human Chromosome 17q21- 
q24. 

In Northern blot studies using poly(A) + RNA prepared from brain 
and different peripheral tissues, the 700-nucleotide hypocretin mRNA was 
5 detected only in brain samples. Previous studies with RNA from different 
regions of the brain had detected the hypocretin mRNA predominantly in 
hypothalamus samples. In samples of RNA from whole brains of developing 
rats, hypocretin mRNA was detected at low concentrations as early as embryonic 
day 18, but increased in concentration dramatically after the third postnatal week. 
10 There was no detectable difference between brain samples from adult males and 
females, suggesting that the late onset was not related to sexually dimorphic 
processes. In sita hybridization studies detected cell bodies in the dorsal-lateral 
hypothalamus and in cells that line the ventricles. 

EXAMPLE 5 

15 A polyclonal antiserum (serum 2050) was raised to a chemically 

synthesized peptide corresponding to the C-terminal 17 amino acid residues 
(CPTATATACAPRGGSRV) of the rat preprohypocretin sequence. In Western 
transfer blots using as target electrophoretically separated proteins from bacteria 
transformed with the plasmid pRSET B engineered to express preprohypocretin, a 

20 single prominent immunoreactive band was observed with a migration of 

approximately 19kDa with the hyperimmune serum, but not with the preimmune 
serum. No immunoreaction was detected with an extract from bacteria 
transformed with a preprohypocretin/pRSET B expression plasmid, indicating that 
detection of the 19kDa target requires hypocretin expression. Analogous results 

25 were obtained with an additional antiserum to the 17mer and two antisera to 
synthetic hcrt2. 

In immunohistochemical studies with antiserum 2050 on sections 
from perfused adult male rats, immunoreactive cell bodies were observed 
exclusively in the perifornical nucleus and dorsal and lateral hypothalamic areas, 
30 consistent with the in situ hybridization results (Fig. 4). This coincident staining, 
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its elimination when the serum was preincubated with the peptide immunogen, 
and the very low nonspecific background observed, together with the Western 
blot results, provided strong evidence for the specificity of the antiserum for 
hypocretin. In addition to cell bodies, the serum detected a prominent network of 
5 fibers located within the hypothalamus, particularly the posterior region. Less 
prominent fiber projections were observed in apparent terminal fields within the 
preoptic area, the medial dorsal and reuniens nuclei of the thalamus, the dorsal 
raphe nucleus, the locus coeruleus, the laterodorsal tegmental nucleus, the central 
gray, the colliculi and the nucleus of the solitary tract. In immuno-electron 
10 microscopy studies, immunoreactive secretory vesicles were observed. 

EXAMPLE 6 

The putative structures of the hypocretins, their expression within 
the dorsolateral hypothalamus and accumulation within fibers and vesicles 
suggested that they may have intercellular signaling activity. To test this 

15 hypothesis, 10-day cultures of synaptically-coupled rat hypothalamic neurons 
were prepared and postsynaptic currents were recorded under voltage clamp. 
Application of a synthetic peptide corresponding to amidated hcrt2 at IfxM evoked 
a substantial, but reversible, increase in the frequency of postsynaptic currents in 
75% of the neurons tested (Fig. 7A), indicating an increase in the activity of 

20 presynaptic axons, and suggesting an increase in excitation. The other 25% of 

the cells showed no response to hcrt2. There was little response by hypothalamic 
neurons that had been in culture for only 3-5 days, suggesting that a certain 
degree of synaptic maturity was required for the effect. Hcrt2 elicited no 
response from synaptically coupled hippocampal dentate granule neurons in 

25 culture, demonstrating target selectivity and suggesting that specific receptors for 
hcrt2 may exist. 

EXAMPLE 7 

Synthetic, amidated hcrt2 peptide at different concentrations was 
infused intracerebroventricularly in rats and body temperature by was monitored 
30 telemetry. Stereotactic ablation studies have previously implicated the dorsal- 
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lateral hypothalamus in feeding behavior, blood pressure, and central regulation 
of immune function, although precise nuclei have not been correlated with these 
activities. A threshold-type response was obtained in which, at the highest dose, 
10 /xg, body temperature dropped from 37.7 to 36.7 degrees Celsius over 30 
5 minutes following administration, then recovered to normal over 2 hours. Food 
intake was monitored over 2 hours following administration and a 40% reduction 
in food intake was measured at a dose of 5/*g. Whereas the concentrations of 
peptide required for an effect might seem high, they are comparable to the doses 
of leptin administered ICV to obtain a comparable suppression of food intake. 

10 The presumed target cells of hypocretin may not be very accessible by this 
unphysiological mode of administration. Local injection or intravenous 
administration of hypocretin might be more suitable for physiological studies. 

The cell bodies that produce the hypocretins are located in an area 
implicated in ablation studies as regulatory centers for appetitive behaviors, 

15 suggesting that the hypocretins may serve as a major transmitters for the central 
system signalling the status of energy balance in the major fat repositories. The 
projections of hypocretin-producing cells indicate that the peptides function both 
within the hypothalamus and at a complex and diffuse network of targets in 
several regions of the brain that may coordinate the various aspects of appetitive 

20 behavior, adaptive thermogenesis and metabolic regulation. 

Rat hypothalamus from 18-day embryos was cultured for 10 days in vitro . 
The mediobasal hypothalamus was removed from embryonic day 18 Sprague 
Dawley rats. The tissue was enzymatically digested in a mild protease solution 
(10 U/ml papain and 0.2 mg/ml L-cysteine in Earle's balanced salt solution) for 

25 30 minutes. Next, the tissue was pelleted, and the protease solution was 
removed. Tissue was then suspended in standard tissue culture medium 
(glutamate- and glutamate-free DMEM supplemented with 10% fetal bovine 
serum, 100 U/ml penicillin/streptomycin, and 6 gm/1 glucose) and then triturated 
into a single-cell suspension. Cells were washed and pelleted an additional three 

30 times. The single-cell suspension was plated onto 22 mm 2 glass coverslips that 
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had been coated with high-molecular-weight (540,000 Da) poly-D-lysine . High- 
density cultures (200,000/cm 2 ) were used for all experiments. Hypothalamic 
neural cultures were maintained in a Napco 3600 incubator (37 degree Celsius 
and 5% C0 2 ) until they were ready for use. To limit non-neuronal cell 
5 proliferation cytosine arabinofuranoside (1 /*M) was added to the tissue culture 
medium 1 day after plating. 

Synaptically coupled hypothalamic neurons were recorded in voltage clamp 
with a whole cell pipette (holding potential = -60mV), This recording is typical 
of 9 of 12 cells examined under these conditions. The frequency of postsynaptic 

10 events (PSCs) was greatly increased (up to +400%) by 1/xM hcrt2 applied to the 
bath. After washout of the peptide, the frequency of PSCs returned to normal 
baseline levels. Inset boxes show higher resolution of the events indicated by the 
dotted line. Both boxes (in Fig. 7 A and B) were recorded with an identical 
delay after hcrt2 administration. A less mature hypothalamic neuron after 4 days 

15 of culture was unresponsive to 1/xM hcrt2 (Fig. 7B). Pipette solution contained 
128 mM KMeS0 4 , 27 mM KC1, 0.4 mM EGTA, 1 mM ATP, and 0.5 mM GTP. 

The preceding written description provides a full, clear, concise 
and exact disclosure of the invention so as to enable one skilled in the art to make 
and use the same. This disclosure should not be construed so as to impart any 

20 direct or implied limitation upon the scope of the invention which is particularly 
pointed out and distinctly claimed below. 
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Table 1 Cumulative Data From 100 Clones 





Clone* 


BLAST Homology" 


Accession^ 


# d 


Pattern" 


5 


2 + 


oxytocin 


M25649 


13 


A/A 




6 + 


VAT1-like 


T05306 


11 


B/B 




1 + 


CART 


U10071 


7 


C 




35 + 


novel 




6 


A/A 




15 + 


novel 




4 


B/C 


10 


25 + 


POMC 


J00759 


4 


A 




12 + 


novel(E) 


R75926 


3 


B/B 




16 + 


vasopressin 


M25646 


3 


A 




18 + 


glutathione perox 


U 13705 


3 


B 




29 + 


novel CaM kinase 




3 


B/C 


15 


3 + 


novel 




2 


B/C 




10 + 


novel 




2 


B/B 




51 + 


ubiquitin carrier 


M91679 


2 


C 




62 + 


novel 




2 


C 




5 - 


calbindin 


U08290 




B 


20 


14 + 


melanin-cone hormone 


M62641 




C 




17 + 


asp aminotrans 


M 18467 




D 




19 - 


novel(E) 


R74893 




D 




20 + 


novel 






A/B 




21 - 


novel(E) 


T32756 




A/D 


25 


22 - 


novel 






D 




33 + 


novel(E) 


R67552 




A/A 




34 - 


CI7HC0 3 ' exchanger 


J05167 




C 




37 + 


novel 






B/D 




39 - 


novel 






C 


30 


45 + 


novel 






C 




46 + 


fibromodulin 


X82152 




C 




47 


perox enolhydratase 


U08976 




C 




48 + 


galanin 


J03624 




B 




52 - 


5-HT 2 receptor 


L31546 




B 


35 


53 + 


MHC orf 


M32010 




E 




55 + 


HNF dimer cofactor 


M83740 




C 




56 + 


carbonyl reductase 


X84349 




C 




57 + 


tyrosine hydroxylase 


M 10244 




A 




63 + 


novel 






D 


40 


67 + 


novel 






B/B 




73 + 


novel 






C 




74 + 


novel(E) 


T93996 




C 




75 + 


lamin C2 


D14850 




A 




86 + 


novel 






C 


45 


92 - 


novel(E) 


R49544 




C 




98 + 


novel 






B/D 




99- 


neuronal kinesin 


U06698 




B/D 



8 number of prototype clone in set of 1 00 followed by indication (+/-) as to whether 3' sequence 
50 contained poly(A)-addition hexad (no 3' sequence for clone 47) 

b short name of matching species or novel for no match:(E) indicates EST match 

c GenBank database reference 

d number of representatives in set of 100 

• hybridization pattern in cDNA library Southern assay/Northern blot assay. Code:A, target only; 
55 B, target highly enriched; C, hypothalamus and hippocampus; D, not highly enriched; E, too 

faint to categorize 
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SEQUENCE LISTING 



(1) GENERAL. INFORMATION: 

(i) APPLICANT: Sutcliffe, J. Gregor 

Gautvik, Kaare M. 
De Lecea, Luis 
Bloom, Floyd E. 
Danielson, Patria E. 
Kilduff, T.S. 
Gautvik, Vigdis T . 
Foye, Pamela E. 

(ii) TITLE OF INVENTION: Hypothalamus -Specific Polypeptides 
(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Olson & Hierl , Ltd. 

(B) STREET: 20 North Wacker Drive, 36th Floor 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP : 60606 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 01-AUG-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 6 0/023,22 0 

(B) FILING DATE: 02-AUG-1996 

( C ) CLAS S I F I CAT ION : 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Arne M. Olson 

(B) REGISTRATION NUMBER: 3 0,2 03 

(C) REFERENCE/DOCKET NUMBER: 548 . IP 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-580-1180 

(B) TELEFAX: 312-580-1189 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Asn Leu Pro Ser Thr Lys Val Pro Trp Ala Ala Val Thr Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Pro Pro Ala Leu Leu Ser Leu Gly Val Asp Ala 
20 25 30 

Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 
35 40 45 

Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 
50 55 60 

Leu Gly Lys Arg Arg Pro Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin 
65 70 75 " 80 

Arg Leu Leu Gin Ala Asn Gly Asn His Ala Ala Gly lie Leu Thr Met 
85 90 95 

Gly Arg Arg Ala Gly Ala Glu Leu Glu Pro Tyr Pro Cys Pro Gly Arg 
100 105 110 

Arg Cys Pro Thr Ala Thr Ala Thr Ala Leu Ala Pro Arg Gly Gly Ser 
115 120 125 

Arg Val 
130 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asn Phe Pro Ser Thr Lys Val Pro Trp Ala Ala Val Thr Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Pro Pro Ala Leu Leu Ser Leu Gly Val Asp Ala 
20 25 30 

Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr Cys Ser Cys Arg Leu 
35 ^40 45 

Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie Leu Thr 
50 55 60 
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Leu Gly Lys Arg Arg Pro Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin 
65 J 70 75 80 

Arg Leu Leu Gin Ala Asn Gly Asn His Ala Ala Gly lie Leu Thr Met 
8 5 90 95 

Gly Arg Arg Ala Gly Ala Glu Leu Glu Pro His Pro Cys Ser Gly Arg 
100 105 110 



Gly Cys Pro Thr Val Thr Thr Thr Ala Leu Ala Pro Arg Gly Gly Ser 
115 120 125 

Gly Val 
130 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 56 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TAAGACGACG GCCTCAGACT CCTTGGGTAT TTGGACCACT GCACCGAAGA TACCATCTCT 6 0 

CCGGATTACC TCTCCCTGAG CTCCAGACAC CATGAACCTT CCTTCTACAA AGGTTCCCTG 120 

GGCCGCCGTG ACGCTGCTGC TGCTGCTACT GCTGCCGCCG GCGCTGCTGT CGCTTGGGGT 18 0 

GGACGCGCAG CCTCTGCCCG ACTGCTGTCG CCAGAAGACG TGTTCCTGCC GGCTCTACGA 240 

ACTGTTGCAC GGAGCTGGCA ACCACGCCGC GGGCATCCTC ACTCTGGGAA AGCGGCGACC 3 00 

TGGACCCCCA GGCCTCCAAG GACGGCTGCA GCGCCTCCTT CAGGCCAACG GTAAC CACGC 3 60 

AGCTGGCATC CTGACCATGG GCCGCCGCGC AGGCGCAGAG CTAGAGC CAT ATCCCTGCCC 42 0 

TGGTCGCCGC TGTCCGACTG CAACCGCCAC CGCTTTAGCG CCCCGGGGCG GATCCAGAGT 480 

CTGAACCCGT CTTCTATCCC TGTCCTAGTC CTAACTTTCC CCTCTCCTCG CCAGTCCCTA 540 

GGCAATAAAG ACGTTTCTCT GTTGGTGTG 569 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TAAGACGACG GCCTCAGACT TCTTGGGTAT TTGGACCACT GCACTGAAGA GATCATCTCT 60 

CCAGATTACT TTCCCCTGAG CTCCAGGCAC CATGAACTTT CCTTCTACAA AGGTTCCCTG 12 0 

GGCCGCCGTG ACGCTGCTGC TGCTGCTACT GCTGCCACCG GCGCTGCTGT CGCTTGGGGT 180 

GGACGCACAG CCTCTGCCCG ACTGCTGTCG CCAGAAGACG TGTTCCTGCC GTCTCTACGA 24 0 

ACTGTTGCAC GGAGCTGGCA ACCACGCTGC GGGTATCCTG ACTCTGGGAA AGCGGCGGCC 3 00 

TGGACCTCCA GGCCTCCAGG GACGGCTGCA GCGCCTCCTT CAGG CCAACG GTAACCACGC 360 

AGCTGGCATC CTGACCATGG GCCGCCGCGC AGGCGCAGAG CTAGAGCCAC ATCCCTGCTC 420 

TGGTCGCGGC TGTCCGACCG TAACTATCAC CGCTTTAGCA CCCCGGGGAG GGTCCGGAGT 480 

TTGAAC C CAT CTTCTATCCT TGTCCTGATC CAAACTTCCC CCTCTGCTCG CCGCTGTCAG 540 

TCTCTTGGTA AATGGCAATA AAGACGTTTC TCTGTTGGTG TG 582 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GCTAGGAGAC ATTGCGGCGG CGGTGGCGGC GTTGGCAGCA GCTGCAGACA TGCTGCTGCT 6 0 

CAAGAAACAG ACGGAGGACA TCAGCAGTGT CTATGAGATC CGGGAGAAGC TGGGCTCGGG 12 0 

TGCCTTCTCT GAGGTGATGC TGGCCCAGGA AAGGGGCTCT GCTCATCTTG TGGCCCTCAA 18 0 

GTGCATTCCC AAGAAAG C AC TTCGGGGCAA GGAGGCCCTG GTGGAGAATG AGATCGCAGT 240 

ACTCCGCAGG ATTAGCCACC CCAACATTGT GGCTCTGGAG GACGTCCACG AGAGCCCTTC 300 

CCATCTCTAC TTGGCCATGG AGCTGGTAAC AGGTGGTGAA CTGTTTGACC GAATCATGGA 36 0 

GCGGGGCTCC TACACAGAGA AGGATGCGAG CCACCTTGTA GGGCAGGTCC TTGGTGCTGT 42 0 
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CTCCTACCTT 


CATAGCCTGG 


GCATCGTGCA 


CCGGGACCTC 


AAGCCTGAAA 


ACCTCCTCTA 


480 


TGCCACACCT 


TTTGAGGACT 


CCAAGATCAT 


GGTCTCTGAC 


TTTGGCCTGT 


CCAAAATTCA 


540 


AGCTGGCAAC 


ATGCTAGGCA 


CAGCCTGTGG 


GACCCCAGGA 


TATGTGGCCC 


CAGAGCTCCT 


600 


GGAG CAGAAA 


CCCTACGGGA 


AGGCCGTAGA 


TGTGTGGGCC 


CTGGGTGTCA 


TCTCCTACAT 


660 


CCTGCTGTGT 


GGGTACCCCC 


CCTTCTATGA 


TGAGAGCGAT 


CCTGAACTCT 


TCAGCCAGAT 


720 


TCTGAGGGCC 


AGCTACGAGT 


TTGACTCTCC 


CTTTTGGGAT 


G AC AT CT C AG 


AATCAGCCAA 


780 


AGACTTCATT 


CGGCACCTTC 


TGGAACGTGA 


TCCCCAGAAG 


AGGTTCACCT 


GCCAACAGGC 


840 


CTTACAGCAT 


CTCTGGATCT 


CTGGGGATGC 


AGCCTTGGAC 


AGGGACATCC 


TAGGTTCTGT 


900 


CAGTGAGCAG 


ATCCAGAAGA 


ATTTTGCCAG 


GACCCACTGG 


AAGCGTGCAT 


TCAATGCCAC 


960 


ATCATTCCTA 


CGTCACATCC 


GTAAGCTGGG 


ACAGAGCCCA 


GAGGGTGAGG 


AGGCCTCCAG 


1020 


GCAGGGTATG 


ACCCGTCACA 


GCCACCCAGG 


CCTTGGGACT 


AGCCAGTCTC 


CCAAGTGGTG 


1080 


ACAACCAGGT 


GGATGCCAAG 


GAAGGCCAAG 


TGGACTGACT 


CCTAGCTTTT 


CTTTCCTCCA 


1140 


GC(-V_ 111 1<J.M. 




TGATCCTTGT 


CCCCCGGACT 


GGCCTCTGTT 


GGAAAGTCCA 


1200 


AGACCGTGGG 


TGTGATGCAT 


GGCACTGGGG 


TATGGGGCTT 


CCCAAGTATG 


TCCCCAGCCT 


1260 


CTGTCCTTTG 


TTGCTGCCAC 


CCTCTATGGA 


AACTGAGGAG 


GTATTCAAAA 


ATGGATTTGG 


1320 


GGGCCATGCT 


TCCTGCACCT 


TGCACGCACA 


TATGCATTGC 


GTGGCTGTTC 


TGTGCTTTGC 


1380 


TGACTGTGGG 


TGGTCCTGCT 


TGTGTTGTAG 


CCCTTTAGTT 


CCTCCTCTTT 


CCAACCAATA 


1440 


AAGACAAACA 


GAACAATG 










1458 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNES S : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANT I - SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Leu Gly Val Asp Ala Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr 
! 5 10 15 

Cys Ser Cys Arg Leu Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala 
20 25 30 

Ala Gly He Leu Thr Leu Gly Lys Arg Arg Pro Gly Pro Pro Gly Leu 
35 40 45 
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Gin Gly Arg Leu Gin Arg Leu Leu Gin Ala Asn Gly Asn His Ala Ala 
50 ~ 55 60 

Gly lie Leu Thr Met Gly Arg Arg Ala Gly Ala Glu Leu Glu Pro Tyr 
65 70 75 80 

Pro Cys Pro Gly Arg Arg Cys Pro Thr Ala Thr Ala Thr Ala Leu Ala 
85 90 95 

Pro Arg Gly Gly Ser Arg Val 
100 

(2) INFORMATION FOR SEQ ID NO ; 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Leu Gly Val Asp Ala Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr 

1 1 ' 5 10 15 

Cys Ser Cys Arg Leu Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala 

2 0 25 30 

Ala Gly lie Leu Thr Leu Gly 
35 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Leu Gly Val Asp Ala Gin Pro Leu Pro Asp Cys Cys Arg Gin Lys Thr 

1 " 5 10 15 

Cys Ser Cys Arg Leu Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala 

20 25 30 

Ala Gly lie Leu Thr Leu 
35 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANT I - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Pro Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin Ala 
15 10 15 

Asn Gly Asn His Ala Ala Gly He Leu Thr Met Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Pro Gly Pro Pro Gly Leu Gin Gly Arg Leu Gin Arg Leu Leu Gin Ala 
15 10 15 

Asn Gly Asn His Ala Ala Gly He Leu Thr Met 
20 25 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Leu Tyr Glu Leu Leu His Gly Ala Gly Asn His Ala Ala Gly lie 
1 5 10 15 

Leu Thr Leu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Arg Leu Gin Arg Leu Leu Gin Ala Asn Gly Asn His Ala Ala Gly lie 
15 10 15 

Leu Thr Met Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Asn His Ala Ala Gly lie Leu Thr 
1 5 
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We claim: 

1 . An isolated polynucleotide selected from the group 

consisting of: 

(a) a polynucleotide encoding the polypeptide comprising the amino 
5 acid sequence of SEQ ID NO:l; 

(b) a polynucleotide encoding the polypeptide comprising the amino 
acid sequence of SEQ ID NO:2; 

(c) a polynucleotide capable of hybridizing to and which is at least 
95% homologous to the polynucleotide of (a) or (b). 

10 2. The polynucleotide of claim 1 comprising the sequence of 

SEQ ID NO:3. 

3. The polynucleotide of claim 1 comprising the sequence of 
SEQ ID NO:4. 

4. An isolated polynucleotide selected from the group 

15 consisting of: 

(a) a polynucleotide encoding a polypeptide comprising the 
sequence of SEQ ID NO:6; 

(b) a polynucleotide encoding a polypeptide comprising amino 
acids 28 to 130 of SEQ ID NO:2; 

20 (c) a polynucleotide capable of hybridizing to the polynucleotide of 

(a) ; 

(d) a polynucleotide capable of hybridizing to the polynucleotide of 

(b) ; 

(e) a polynucleotide that is at least 95 % homologous to the 
25 polynucleotide of (a); and 

(0 a polynucleotide that is at least 95 % homologous to the 
polynucleotide of (b). 

5. An isolated polynucleotide selected from the group 

consisting of : 

30 (a) a polynucleotide encoding a polypeptide comprising the 
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sequence of (SEQ ID NO:7); 

(b) a polynucleotide encoding a polypeptide comprising the 
sequence of (SEQ ID NO: 8); 

(c) a polynucleotide encoding a polypeptide comprising amino acids 
5 42 to 66 of (SEQ ID NO:l); 

(d) a polynucleotide encoding a polypeptide comprising amino 
acids 42 to 65 of (SEQ ID NO:l); 

(e) a polynucleotide encoding a polypeptide comprising amino 
acids 43 to 66 of (SEQ ID NO:l); 

10 (f) a polynucleotide encoding a polypeptide comprising amino acids 

43 to 65 of (SEQ ID NO:l); and 

(g) a polynucleotide capable of hybridizing to and which is at least 
95% homologous to a polynucleotide of (a) through (f). 

6. An isolated polynucleotide selected from the group 

15 consisting of: 

(a) a polynucleotide encoding a polypeptide comprising the 
sequence of (SEQ ID NO:9); 

(b) a polynucleotide encoding a polypeptide comprising the 
sequence of (SEQ ID NO: 10); and 

20 (c) a polynucleotide capable of hybridizing to and which is at least 

95% homologous to the polynucleotide of (a) or (b). 

7. An isolated polynucleotide selected from the group 

consisting of: 

(a) a polynucleotide encoding a polypeptide comprising amino acids 
25 100 to 130 of (SEQ ID NO:l); 

(b) a polynucleotide encoding a polypeptide comprising amino 
acids 100 to 130 of (SEQ ID NO:2); 

(c) a polynucleotide encoding a polypeptide comprising amino acids 
100 to 111 of (SEQ ID NO:l); 

30 (d) a polynucleotide encoding a polypeptide comprising amino 
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So 

acids 100 to 110 of (SEQ ID NO:l); 

(e) a polynucleotide encoding a polypeptide comprising amino acids 
114 to 130 of (SEQ ID NO:l); and 

(f) a polynucleotide capable of hybridizing to and which is at least 
5 95% homologous to a polynucleotide of (a) through (e). 

8. An isolated polypeptide selected from the group consisting 
of the amino acid sequence of SEQ ID NO:l and the amide thereof. 

9. An isolated polypeptide selected from the group consisting 
of the amino acid sequence of SEQ ID NO: 2 and the amide thereof. 

10 10. An isolated polypeptide selected from the group consisting 

of a polypeptide comprising the sequence of SEQ ID NO: 6, a polypeptide 
comprising amino acids 28 to 130 of SEQ ID NO:2, and the amides thereof. 

11. An isolated polypeptide comprising the sequence of 
SEQ ID NO: 13. 

15 12. The isolated peptide of claim 11 wherein the isolated peptide 

is from about 28 amino acid residues to about 39 amino acid residues long. 

13. An isolated polypeptide selected from the group consisting 

of: 

(a) a polypeptide comprising the sequence of SEQ ID NO:7; 
20 (b) a polypeptide comprising the sequence of SEQ ID NO:7; 

(c) a polypeptide comprising amino acids 42 to 66 of SEQ ID 

NO:l; 

(d) a polypeptide comprising amino acids 42 to 65 of SEQ ID 

NO:l; 

25 (e) a polypeptide comprising amino acids 43 to 66 of SEQ ID 



NO:l; 
NO:l; 



(f) a polypeptide comprising amino acids 43 to 65 of SEQ ID 



(g) a polypeptide comprising at least one conservative amino acid 
30 substitution in the sequence of polypeptides (a - f); and 



WO 98/05352 



PCT7US97/13657 



of: 



NO:l); 
NO:l); 



(h) the amides thereof. 

12. An isolated polypeptide selected from the group consisting 



(a) a polypeptide comprising the sequence of SEQ ID NO: 9; 
5 (b) a polypeptide comprising the sequence of SEQ ID NO: 10; 

and 

(c) the amides thereof. 

13. An isolated polypeptide selected from the group consisting 

of: 

10 (a) a polypeptide comprising amino acids 100 to 130 of (SEQ ID 

NO:l); 

(b) a polypeptide comprising amino acids 100 to 130 of (SEQ ID 
NO:2); (c) a polypeptide comprising amino acids 100 to 111 of (SEQ ID 
NO:l); 

15 (d) a polypeptide comprising amino acids 100 to 110 of (SEQ ID 



(e) a polypeptide comprising amino acids 114 to 130 of (SEQ ID 



(f) a polypeptide comprising at least one conservative amino acid 
20 substitution in the sequence of polypeptides (a - e); and 

(g) the amides thereof. 



14. A vector comprising a polynucleotide of claim 1 operably 
linked to control sequences which direct the expression of the polynucleotide. 
25 15. A vector comprising a polynucleotide of claim 2 operably 

linked to control sequences which direct the expression of the polynucleotide. 

16. A vector comprising a polynucleotide of claim 3 operably 
linked to control sequences which direct the expression of the polynucleotide. 

17. A vector comprising a polynucleotide of claim 4 operably 
30 linked to control sequences which direct the expression of the polynucleotide. 
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18. A vector comprising a polynucleotide of claim 5 operably 
linked to control sequences which direct the expression of the polynucleotide. 

19. A vector comprising a polynucleotide of claim 6 operably 
linked to control sequences which direct the expression of the polynucleotide. 

5 20. A vector comprising a polynucleotide of claim 7 operably 

linked to control sequences which direct the expression of the polynucleotide. 

21. A host cell transformed with a vector of claim 14. 

22. A host cell transformed with a vector of claim 15. 

23. A host cell transformed with a vector of claim 16. 
10 24. A host cell transformed with a vector of claim 17. 

25. A host cell transformed with a vector of claim 18. 

26. A host cell transformed with a vector of claim 19. 

27. A host cell transformed with a vector of claim 20. 

28. A pharmaceutical composition comprising a polypeptide of 
15 claim 8 and a pharmaceutical^ acceptable carrier. 

29. A pharmaceutical composition comprising a polypeptide of 
claim 9 and a pharmaceutical^ acceptable carrier. 

30. A pharmaceutical composition comprising a polypeptide of 
claim 10 and a pharmaceutically acceptable carrier. 

20 3 1 . A pharmaceutical composition comprising a polypeptide of 

claim 1 1 and a pharmaceutically acceptable carrier. 

32. A pharmaceutical composition comprising a polypeptide of 
claim 12 and a pharmaceutically acceptable carrier. 

33. A pharmaceutical composition comprising a polypeptide of 
25 claim 13 and a pharmaceutically acceptable carrier. 

34. A method of treating a neurological disease or homeostatic 
dysfunction or controlling the production of a homeostatic regulatory hormone 
comprising introducing an effective amount of the composition of claim 16 into a 
mammal in need of such treatment. 

30 35. An antibody that immunoreacts with an isolated mammalian 
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^3 

H35 protein. 

36. An antibody of claim 35 which is a monoclonal antibody. 

37. A kit for detecting the presence of an H35 protein in a 
mammalian sample comprising an antibody which immunoreacts with a 

5 mammalian H35 protein or with a polypeptide of claim 10 in an amount sufficient 
for at least one assay and suitable packaging material. 

38. A kit for detecting the presence of an H35 protein in a 
mammalian sample comprising an antibody which immunoreacts with a 
mammalian H35 protein or with a polypeptide of claim 11 in an amount sufficient 

10 for at least one assay and suitable packaging material. 

39. A kit for detecting the presence of an H35 protein in a 
mammalian sample comprising an antibody which immunoreacts with a 
mammalian H35 protein or with a polypeptide of claim 12 in an amount sufficient 
for at least one assay and suitable packaging material. 

15 40. A kit for detecting the presence of an H35 protein in a 

mammalian sample comprising an antibody which immunoreacts with a 
mammalian H35 protein or with a polypeptide of claim 13 in an amount sufficient 
for at least one assay and suitable packaging material. 

41. The kit of claim 37 further comprising a detecting antibody 
20 which binds to the anti-H35 antibody. 

42. The kit of claim 41 wherein the detecting antibody is 

labeled. 

43. The kit of claim 42 wherein the label comprises enzymes, 
radioisotopes, fluorescent compounds, colloidal metals, chemiluminescent 

25 compounds, phosphorescent compounds, or bioluminescent compounds. 

44. A kit for detecting the presence of genes encoding an H35 
protein comprising a polynucleotide of claim 1 , or fragment thereof having at 
least 10 contiguous bases, in an amount sufficient for at least one assay, and 
suitable packaging material. 

30 45. A method for detecting the presence of a nucleic acid 
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encoding an H35 protein in a mammalian sample, comprising the steps of: 

(a) hybridizing a polynucleotide of claim 1, or fragment thereof 
having at least 10 contiguous bases, with the nucleic acid of the sample; and 

(b) detecting the presence of the hybridization product. 

5 46. A method of detecting an H35 antigen in a mammalian 

sample comprising the steps of: 

(a) contacting the sample with an anti-H35 antibody which 
immunoreacts with a hypocretin polypeptide; and 

(b) detecting the presence of an immunoreaction complex. 

10 47. The method of claim 46 wherein said complex is detected 

by admixing said immunoreaction complex with a detecting antibody capable of 
immunoreacting with said anti-H35 antibody. 

48. The method of claim 47 wherein the detecting antibody is 

labeled. 

15 49. The method of claim 46 wherein the anti-H35 antibody is 

immobilized on a solid support. 

50. The method of claim 46 wherein the sample comprises cells. 

51. The method of 50 wherein the cells are peripheral blood 
mononuclear cells. 

20 52. The method of claim 46 wherein the immunoreaction 

complex of step (b) is detected by flow cytometry. 

53. The method of claim 46 wherein the immunoreaction 
complex of step (b) is detected by ELISA. 

54. A method of claim 46 wherein the immunoreaction complex 
25 of step (b) is detected by immunoblot analysis. 

55. A polynucleotide comprising the sequence of SEQ ID NO: 5. 

56. A vector comprising the sequence of SEQ ID NO:5. 

57. A host cell transfected with the vector of claim 56. 
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Fig. 4 
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M N L/F P S T K V 
ATG AAC CTT CCT TCT ACA AAG GTT 
ATG AAC TTT CCT TCT ACA AAG GTT 
* 

CTG CTG CTA CTG CTG CCG CCG GCG 
CTG CTG CTA CTG CTG CCG CCG GCG 

QPLPDCCR 
CAG CCT CTG CCC GAC TGC TGT CGC 
CAG CCT CTG CCC GAC TGC TGT CGC 

YELLHGAG 
TAC GAA CTG TTG CAC GGA GCT GGC 
TAC GAA CTG TTG CAC GGA GCT GGC 

LGKSKPGP 
CTG GGA AAG CGG CGA CCT GGA CCC 
CTG GGA AAG CGG CGG CCT GGA CCT 

RLLQANGN 
CGC CTC CTT CAG GCC AAC GGT AAC 
CGC CTC CTT CAG GCC AAC GGT AAC 

GJ2HAGAEL 
GGC CGC CGC GCA GGC GCA GAG CTA 
GGC CGC CGC GCA GGC GCA GAG CTA 

JR/G C P T A/V T A/T T 

CGC TGT CCG ACT GCA ACC GCC ACC 

GGC TGT CCG ACC GTA ACT ACC ACC 

* * * * * 
R/G V 

AGA GTC TGA 
GGA GTC TGA 



PWAAVTLL 
CCC TGG GCC GCC GTG ACG CTG CTG 
CCC TGG GCC GCC GTG ACG CTG CTG 

LLSLGVDA 
CTG CTG TCG CTT GGG GTG GAC GCG 
CTG CTG TCG CTT GGG GTG GAC GCA 

■*• 

QKTCSCRL 
CAG AAG ACG TGT TCC TGC CGT CTC 
CAG AAG ACG TGT TCC TGC CGT CTC 

NHAAGI LT 
AAC CAC GCC GCG GGC ATC CTC ACT 
AAC CAC GCT GCG GGT ATC CTG ACT 

* ★ * 

PGLQGRLQ 
CCA GGC CTC CAA GGA CGG CTG CAG 
CCA GGC CTC CAG GGA CGG CTG CAG 
* 

HAAGILTM 
CAC GCA GCT GGC ATC CTG ACC ATG 
CAC GCA GCT GGC ATC CTG ACC ATG 

E P Y/H P C P/S G R 
GAG CCA TAT CCC TGC CCT GGT CGC 
GAG CCA CAT CCC TGC TCT GGT CGC 

+ * 
ALAPRGGS 
GCT TTA GCG CCC CGG GGC GGA TCC 
GCT TTA GCA CCC CGG GGA GGG TCC 

* * * 



B 



consensus- RL LL GNHAAGILT G 

hcrtl • LGVDAQPLPDCCRQKTCSCRLYELLHGAGNHAAGILTLG 

hcrt2 " PGPPGLQGRLQRLLQANGNHAAGILTMG 

SECRETIN : HSDGTFTSKLSRLRDSARLQRLLQGLV HSDGTFTSK 

----- ********* ***** 
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GCTAGGAGACATTGCGGCGGCGGTGGCGGCGTTGGCAGCAGCTGCAGACATGCTGCTGCT 

"~ u « ■» ^ ^ — — — — — — — — +• — — — — — — — — — +. — — ~ „ — — — ■. — + — — — — — — — — — + 60 

1 CGATCCTCTGTAACGCCGCCGCCACCGCCGCAACCGTCGTCGACGTCTGTACGACGACGA 

M I* L L 

CAAGAAACAGACGGAGGACATCAGCAGTGTCTATGAGATCCGGGAGAAGCTGGGCTCGGG 

_ _ - — — _ — — - — — — — + •» — — — — — — — — + — — — — — ~ — — — + ~ — — — — — — *- — + 12 0 

61 G TTCTT TGTCT G CCT CCTG T AGTC GTCACAGATACT CT AGGCC CT CTT C GAC C CG AG C CC 
X X QTEDISSVYEIREKLGSG 

TGCCTTCTCTGAGGTGATGCTGGCCCAGGAAAGGGGCTCTGCTCATCTTGTGGCCCTCAA 

12 i + -- -+ + + + + 180 

ACGGAAGAGACTCCACTACGACCGGGTCCTTTCCCCGAGACGAGTAGAACACCGGGAGTT 

AFSEVMLAQERGSAHLVALK 

GTGCATTCCCAAGAAAGCACTTCGGGGCAAGGAGGCCCTGGTGGAGAATGAGATCGCAGT 

+ + --- + - + - - + 240 

181 CACGTAAGGGTTCTTTCGTGAAGCCCCGTTCCTCCGGGACCACCTCTTACTCTAGCGTCA 
CIPKKALRGKEALVENEIAV 

ACTCCGCAGGATTAGCCACCCCAACATTGTGGCTCTGGAGGACGTCCACGAGAGCCCTTC ^ 

241 TGAGGCGTCCTAATCGGTGGGGTTGTAACACCGAGACCTCCTGCAGGTGCTCTCGGGi^ 
LRRISHPNIVALEDVHESPS 

CCATCTCTACTTGGCCATGGAGCTGGTAACAGGTGGTGAACTGTTTGACCGAATCATGGA ^ 

301 GGTAGAGATGAACCGGTACCTCGACCATTC 

HLYLAMELVTGGELFDRIME 

GCGGGGCTCCT ACACAGAGAAGGATGCGAGCCACCTTGTAGGGCAGGTCCTTGGTGCTGT ^ 

361 cGCCCCGAGGATC^ 

RGSYTEKDASHLVGQVLGAV 



421 



CTCCTACCTTCATAGCCTGGGCATCGTGCACCGGGACCTCAAGCCTGAAAACCTCCTCTA 

+ + + + + + 480 

GAGGATGGAAGTATCGGACCCGTAGCACGTGGCCCTGGAGTTCGGACTTTTGGAGGAGAT 

SYLHSLGIVHRDLKPENLLY 



TGCCACACCTTTTGAGGACTCCAAGATCATGGTCTCTGACTTTGGCCTGTCCAAAATTCA ^ 

481 ACGGTGTGGAAAACTCCTGAGGTTCT 

ATPFEDSKlMVSDFGIiSKlQ 



Fig. 6 



WO 98/05352 



9/11 



PCT/US97/13657 



AGCTGGCAACATGCTAGGCACAGCCTGTGGGACCCCAGGATATGTGGCCCCAGAGCTCCT 



541 + + + + + + 600 

TCGACCGTTGTACGATCCGTGTCGGACACCCTGGGGTCCTATACACCGGGGTCTCGAGGA 
AGNMLGTACGTPGYVAPELL 

GGAGCAGAAACCCTACGGGAAGGCCGTAGATGTGTGGGCCCTGGGTGTCATCTCCTACAT 

601 - + - + + + + + 660 

CCTCGTCTTTGGGATGCCCTTCCGGCATCTACACACCCGGGACCCACAGTAGAGGATGTA 
EQKPYGKAVDVWALGVISYI 

CCTGCTGTGTGGGTACCCCCCCTTCTATGATGAGAGCGATCCTGAACTCTTCAGCCAGAT 

661 + + + ---+ + + 720 

GGACGACACACCCATGGGGGGGAAGATACTACTCTCGCTAGGACTTGAGAAGTCGGTCTA 
LLCGYPPFYDESDPELFSQI 

TCTGAGGGCCAGCTACGAGTTTGACTCTCCCTTTTGGGATGACATCTCAGAATCAGCCAA 

721 + + + + + + 780 

AGACTCCCGGTCGATGCTCAAACTGAGAGGGAAAACCCTACTGTAGAGTCTTAGTCGGTT 
LRASYEFDSPFWDDISESAK 

AGACTTCATTCGGCACCTTCTGGAACGTGATCCCCAGAAGAGGTTCACCTGCCAACAGGC 

781 + + + + + + 840 

TCTGAAGTAAGCCGTGGAAGACCTTGCACTAGGGGTCTTCTCCAAGTGGACGGTTGTCCG 
DFIRHLIiERDPQKRFTCQQA 

CTTACAGCATCTCTGGATCTCTGGGGATGCAGCCTTGGACAGGGACATCCTAGGTTCTGT 

841 + + + + + + 900 

GAATGTCGTAGAGACCTAGAGACCCCTACGTCGGAACCTGTCCCTGTAGGATCCAAGACA 
LQHLWISGDAAIiDRDILGSV 

C AGTG AG CAG AT C C AGAAGAATTTTGCCAGG ACCC A CTGG AAG CGTG CATTC AATG CCAC 

901 + + + + + + 960 

GTCACTCGTCTAGGTCTTCTTAAAACGGTCCTGGGTGACCTTCGCACGTAAGTTACGGTG 
SEQIQKNFARTHWKRAFNAT 

ATCATTCCTACGTCACATCCGTAAGCTGGGACAGAGCCCAGAGGGTGAGGAGGCCTCCAG 

961 + + + + + + 1020 

TAGTAAGGATGCAGTGTAGGCATTCGACCCTGTCTCGGGTCTCCCACTCCTCCGGAGGTC 
SFLRHIRKLGQSPEGEEASR 

GCAGGGTATGACCCGTCACAGCCACCCAGGCCTTGGGACTAGCCAGTCTCCCAAGTGGTG 

1021 + + + + + + 1080 

CGTCCCATACTGGGCAGTGTCGGTGGGTCCGGAACCCTGATCGGTCAGAGGGTTCACCAC 
QGMTRHSHPGIiGTSQS PKWV 
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ACAACCAGGTGGATGCCAA.GGAAGGCCAAQTGGACTGACTCCTAGCTTTTCTTTCCTCCA 

«„»—*.— — — — — — — — — + — ————-———+— — — — — — — ——+—————————+ X 1 4 0 

1081 TGTTGGTC^CCTACGGTTCCTTCCGGTTCACCTGACTGAGGATCG 
TTRWMPRKAKWTDS 

GCCCTTTTGATCTCCTTCCCTGATCCTTGTCCCCCGGACTGGCCTCTGTTGGAAAGTCCA ^ ^ ^ 
1141 CGGGAAAACTAGAGGAAGGGACTAGGAACAGGG 

AGACCGTGGGTGTGATGCATGGCACTGGGGTATGGGGCTTCCCAAGTATGTCCCCAGCCT 

_ — — 4. - — - — — - — — - + —- — —-—-—-+-——-—-—--+— - - - — — — ** - + 12oU 
1201 TCTGGCACCCACACTACGTACCGTGACCCCATACCCCGAAGGGTTCATACAGGG^ 

CTGTCCTTTGTTGCTGCCACCCTCTATGGAAACTGAGGAGGTATTCAAAAATGGATTTGG ^ ^ 
1261 GACAGGAAACAACGACGGTGGGAGATACCTTTGACTCCTCCATJ^ 

GGGCCATCCTTCCTGCACCTTGCACGCACATATGCATTGCGTGGCTGTTCTGTGCTTTGC 
1321 CCCTOTAGGAAGGACGTGGAACGTGCGTGTATACG 

TGACTGTGGGTGGTCCTGCTTGTGTTGTAGCCCTTTAGTTCCTCCTCTTTCCAACCAATA 
1381 ACTGACACCCACC^GGACGAACACA 



AAGACAAACAGAACAATG 
TTCTGTTTGTCTTGTTAC 



1441 - — ► 1458 
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